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Introduction 


This book focuses on three topics: general programming 
concepts and practices, 
the 8086 microprocessor 
with its assembly language, and logic design using the 8086 
microprocessor. 
The discussion of general programming 
concepts and practices is rele- 
vant to any microprocessor, 
but the rest of the book is specific to the 8086. As such, this 
book becomes a how-to text for the 8086. 
The prime source for the 8086 microprocessor 
is: 


INTEL CORPORATION 
3065 Bowers Avenue 
Santa Clara, California 
9505 I 


MOSTEK, INC. 


1215 West Crosby Road 
Carrollton, 
Texas 
75006 


The discussion of general programming 
concepts and practices begins by looking 
at the relationship 
between the programmer 
and a computer, 
since this is ultimately 
what determines 
the nature of any design project. Why do some programmers 
work with 
machine language while others program in assembly language or perhaps higher level 
languages? 
Different types of applications call for different types of programming. 
In 
each case good programming practices should be cultivated. A set of rules is described to 
achieve this goal, and two examples are used to illustrate programming 
projects. 


The description 
of the 8086 microprocessor 
itself covers 
assembly 
language 
programming 
and hardware design. 


For the assembly 
language 
programmer, 
the 8086 CPU architecture 
and the 


microprocessor's 
assembly language instruction 
set are described in detail. 


For the hardware designer, timing and bus considerations 
are described for all sig- 
nals normally input to the microprocessor 
or output 
by it. Single-bus and multi-bus 
architectures 
are covered. The standard Intel Multibus is described in detail. 


This book assumes that you have a working knowledge of general microprocessor 
concepts, 
and the ideas presented 
in An Introduction to Microcomputers: Volume I - 
Basic Concepts, 2nd Revision, 
by A. Osborne, 
Osborne/McGraw-Hili, 
1980. Accor- 
dingly, this book does not cover any elementary 
material such as binary arithmetic, 
buffers, or CPU architecture 
fundamentals. 
The 8086 microprocessor 
and its immediate 
support parts are described in great 
detail within this book. The 8089 I/O Processor is referred to, but it is not described in 
detail. For a detailed description of this part, see The 8089110 Processor Handbook, by A. 
Osborne, 
Osborne/McGraw-Hili, 
1980. 
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Programming 


What is the function of assembly language in a microcomputer 
system? How does 


it differ from machine language or higher level language programming? 
This chapter 


will answer these questions by assessing the various roles that assembly language plays. 


In a very general sense, all microcomputer 
systems take the following form: 


Input Lines 
~ 
T_h_e__ L 
Output 
Lines 
I 
System I~ 


where the input lines are used to provide information 
to the system, and the output lines 


are used to transmit information 
from the system. Generally, the system consists of the 
following: 


Program 
Memory 


The Central Processing Unit (CPU) takes data, through the I/O interface, from the 
input lines; the CPU manipulates 
this data by executing 
instructions 
from its program 
memory. Results are output via the output lines. The CPU stores transient data in the 
data memory. 


The CPU, the I/O interface, and the physical memories are the hardware portions 
of the system. The data that resides in the program memory is the software portion of 
the system or the program. 
Elements of the 8086 assembly language are combined 
to 
form an 8086 assembly language program, which is processed and stored in the program 
memory. Thus, the assembly language is used to specify the program that resides in the 
program memory. 


To understand 
the concept of a program, consider an elementary point-of-sale ter- 
minal that has the following components: 


Keyboard-----..I__c_alc_u_la_tin_9_L 
Display 
I 
Chip I 


As keystrokes 
are entered, 
the calculating chip performs operations 
that convert 
each keystroke into a machine-acceptable 
code. The code could represent a number to 
be manipulated 
or a calculation to be performed. By interpreting 
the codes, the calculat- 
ing chip performs required operations 
and indicates the results on the display. 


The calculating chip accomplishes these operations 
by performing a sequence 
of 
tasks. For example, the calculating chip might perform the following sequence of tasks 
to determine 
whether or not a key has been pressed. 


1. 
Read in keyboard status byte 


2. 
Extract bit 3 from status byte 


(If bit 3 is 0, no key has been depressed. 
If bit 3 is 1, a key has been 
depressed.) 


3. 
Test bit 3 
(If bit 3 is 0, return to step 1. If bit 3 is 1, proceed to step 4.) 


4. 
Perform the next task. This might be a command 
to clear the key depressed 
bit or a command 
to disable the keyboard. 


The complete set of tasks performed by the calculating chip, which would include 
all the translation and calculation operations, 
is known as the algorithm. An algorithm is 
composed of an ordered sequence of well-defined tasks that has a starting point and a 
criterion 
for stopping. 
Algorithms 
are usually expressed 
in the form of the example 
above, that is, English sentences that describe the tasks to be performed. Unfortunately, 
the CPU cannot respond to English sentences 
like "Read 
in Keyboard Status Byte." 
There must be a translation 
from the algorithm, 
which is composed 
of English sen- 
tences, into a form interpretable 
by the CPU, which will consist of a sequence of binary 
CPU instructions. 
The set ofCPU instructions that is used to implement an algorithm is 
known as the object program. 


The CPU executes instructions 
by analyzing units of information 
that consist of 
binary digits, namely digits that are either 1 or O.Simple CPUs have two cycles, instruc- 
tion fetch and instruction 
execute. In the instruction 
fetch cycle, the CPU generates the 
address of the location that contains the next instruction 
(unit of information) 
to be 
executed; 
the CPU requests that the memory provide it with the unit of information 
at 
that location. The memory produces the appropriate 
information. 
During the ensuing 
instruction execute cycle, the CPU analyzes the information 
and performs the appropri- 
ate action. 
For example, assume the following data is present in an Intel 8086 system in order 
to perform the tasks shown in the previous example (Addresses and Instructions 
in bin- 
ary). 


0000 
0001 
0010 
0011 
0100 
0101 


11100100 
00001010 
00100100 
00001000 
01110101 
11111010 


If the 8086 begins executing at location 0000, it will read in the first instruction, 
located at 0000, and analyze it. The CPU determines 
that the instruction 
is an input 
instruction and that the next location, 0001, contains the device address from which the 
data should be read. Therefore the device code is 000010 10. If the device at device code 
00001010 produces the keyboard status byte, executing this 8086 instruction 
will read 
the keyboard status byte into the 8086's AL register. After executing the instruction 
at 
0000, the next instruction 
executed will be the instruction 
at 0010. The instruction 
at 
0010 uses information at 0011 to perform an AND with AL. This extracts bitJ, as in the 
second task in the previous 
example. 
The instruction 
at 0100 and 0101 determines 
whether bit 3 is 1 or 0, then takes appropriate 
action. 
The CPU operates with Is and Os. People, however, 
are not as adept at using Is 
and Os. Therefore 
an intermediate 
step is provided between the CPU's 
Is and as and 
people. This step is assembly language. Instead of directly entering Is and Osto the com- 
puter, people write programs in assembly language. Assembly language programs are 
converted to the appropriate 
Is and Osby a program known as the assembler. 
The user's 
program written in assembly language is known as the source program. 
For example, instead of creating a program out of the Is and Os, as shown above, 
these lines of 8086 assembly code (source code) could be input to the assembler: 


IN 
AND 
JNZ 


AL,OAH 
AL,08H 
TOP 


The assembler converts the code to the Is and Os (object code) of the previous example. 
Assembly language consists of a set of instructions 
that can be converted 
by the 
assembler 
into all the combinations 
of Is and Os that are executable 
by the system. 


For example, the 8086 assembly language instruction: 


This object code is interpreted 
by the 8086 CPU as an instruction 
to AND the contents 
of the AL register with the word 00001000. 


For several reasons, assembly language programming 
will be more efficient than 
programming 
in binary code. First, it is clearly easier to write assembly code using 
assembly language instructions 
like AND, ADD, or XOR, rather than writing instruc- 
tions like 01001000, 
10100010, or 01110000. Second, the possibility for errors when 
entering 
CPU language 
instructions 
is unreasonably 
high. When 
writing assembly 
language, if errors are made, they will usually be caught by the assembler. 


Now consider the relationship 
between the programmer 
and the microcomputer 
system. To make a microcomputer 
system work, these are the tasks that programmers 
commonly 
perform: 


1. 
Specification of the system. A specification includes a general discussion of all 
the functions that the system will provide, plus a description of the character 
of the inputs and outputs the system will handle. 


2. 
Design of a computer 
program that implements 
the specification on a given 
system. This requires that the specification be translated into a series of steps 
that will allow the proposed system to cope with the particular application. 


3. 
Implementation 
of the program design using a particular computer language. 


This phase contains three separate tasks: coding, debugging, and integration. 


4. 
Testing of the complete system. Sets of test data are input to the system. The 
test data is designed to exercise program logic and hardware components. 


5. 
Documentation 
of the system. Adequate 
documentation 
requires a descrip- 
tion of how the entire system works, an operator's 
guide to the system, and 
complete documentation 
of the programs. 


6. 
Maintenance 
of the system. A plan must exist for updating the system should 
new requirements 
or new equipment 
be necessary. 


The above list is always used when programming 
any complex system. There 
exist, however, 
a limited number 
of cases where a 250-page specification, 
including 
three subsections 
on system expansion, a 50-page operator's 
guide, and a rigorous test- 
ing procedure are not necessary. These programs would appear in the following kinds of 
situations: 


1. 
A serial I/O channel has failed. The hardware person is pointing a finger at the 
software, the software person finds it impossible to believe that such a squalid 
piece of hardware is not at fault. Hopefully, the solution is a short program 
that initializes the channel, 
then stands around 
reading and displaying data 
every time the serial I/O channel 
indicates data is available. 
It should 
be 
relatively easy to establish whether the hardware is working; if the hardware is 
working, there can be little doubt about what is not working. 


2. 
A small number 
of non-trivial 
calculations 
need to be made. Fortunately, 
a 
FORTRAN 
system 
is available. 
Hopefully, 
the solution 
is a 20-statement 


FORTRAN 
program that will produce the desired results. 


In both of the above cases, very little specification or program design is committed 
to paper; these steps are performed 
in the programmer's 
head. No documentation 
is 
likely to be produced in these cases, and it is doubtful that maintenance 
will be necess- 
ary. It would be very wise, however, to remem ber that these cases are the exception to 
the rule. 


The preceding list of tasks is used very effectively in environments 
where there 
are multiple 
numbers 
of programmers. 
Some programmers 
execute 
steps 
1 and 2 


exclusively, some programmers 
only perform the implementation 
process, some dedi- 


cate most of their time to testing program systems, 
others 
restrict their activities to 
documentation 
and/or 
maintenance, 
and still others perform some abstruse 
combina- 
tion of tasks. In this way, programmers 
develop specialized skills that may allow for 


greater 
productivity. 
In most assembly 
language 
scenarios, 
however, 
the assembly 


language 
programmer 
is called upon 
to execute 
all of the above tasks. This book 
emphasizes an assembly language approach to the 8086, so a general discussion of all of 
these tasks follows. 


When the acquisition of a microcomputer 
system is initially considered, 
it results 
from one of two kinds of analysis: 


I. 
There is a specific problem that needs to be faced. For example, an aerospace 
manufacturer 
is producing 
a missile guidance 
system which needs an on- 
board computing 
system meeting certain size and speed requirements. 


2. 
There is a specific market for a new microcomputer 
system. 
For example, 
small businesses 
that could not previously afford computerized 
accounting 
will buy when microcomputer-based 
business systems prices fall low enough. 


In either case, it is important 
to specify the exact function that the contemplated 
system will perform. In the first case, the nature of the problem will probably limit the 
system to performing a specific function. In the second case, it is easy to get carried away 
with the specification. 
For a small business system it is necessary to define precisely 
what accounting functions will be performed and exactly how many records of various 
types will be allowed in the system. 
Otherwise 
the microcomputer 
system could be 
assigned more tasks than the hardware has the capacity to handle. 
Referring back to our simplistic model of a microcomputer 
system earlier in this 
chapter, the specification will define the following: 


The inputs received by the system 


The computation 
performed 
by the system 


The outputs created by the system 


The specification of a microcomputer 
system's 
inputs depends a great deal on the 
level of programming 
being performed. An applications programmer, 
writing in BASIC, 
is very unlikely to be concerned with the type of commands that are given to a disk con- 
troller; rather, he will be concerned 
with the type of data on the disk, how records are 
laid out in a disk file, how the operating system constricts his manipulation 
of the disk 
file, etc. Since this book concerns itself with assembly language programming, 
and since 
any reasonable 
discussion 
of data base manipulation 
techniques 
would be beyond the 
scope of this book, we will emphasize the specification of the inputs and outputs at the 
hardware level. 


At the hardware 
level, three parameters 
define the characteristics 
of an input 
channel. They are: 


1. 
The data path width. Input may arrive one bit at a time from a processor con- 
troller error system. A parallel or serial I/O channel will input eight bits at a 
time. A floppy disk controller may transmit 
1024 bits (128 bytes) of informa- 
tion upon request. 


2. 
Data transfer 
speed and type (synchronous 
or asynchronous). 
Data may 
arrive every 200 microseconds 
from a real time clock. A serial 1/0 channel 
may input data asynchronously, 
every 10 milliseconds. An AID converter in a 
control system may transfer data at an undetermined 
rate, but not faster than 
once every 500 milliseconds. 


3. 
Accompanying 
control information. 
A floppy disk may generate an interrupt 
when data is available. A keyboard subsystem may set a status bit when data is 
available. An AID converter may require that the system read input data and 
compare it with prior data to determine 
if new data is available. 


After these parameters have been accounted for, it is important to specify how the 
input channels will be implemented. 
An input channel usually has these three types of ports: 


1. 
Data ports. These ports contain the data that will be passed to the processing 
section of the system. 


2. 
Status ports. These 
ports contain 
information 
that indicates 
when data is 
available, 
whether 
or not errors have occurred 
at this channel, 
and other 
information 
concerning 
the outside world. 


3. 
Control ports. These ports are typically used to initialize the channel's 
mode 
of operation and to control the way in which the channel represents 
itself to 
the outside world. 


All three ports are not always present. In some cases, only the data port is present; 
in some cases, the channel is automatically initialized when power is applied, therefore 
rendering the control port unnecessary. 


-J~--~----- 


When specifying the computation 
section for a microcomputer 
system, there are 
three major areas of concern: 


1. 
Processing 
raw data from the input section. 
This can take the form of a 
translation into a code more readily usable by the system (e.g., from ASCII to 
binary), separating a block of data into its component 
parts (e.g., a sector of 
data from a diskette into the file header, 
header checksum, 
data, and data 
checksum). 


2. 
The actual algorithm implemented 
by the system. While a complete descrip- 
tion of the actual algorithm is usually formed in the Program Design, this part 
of the specification should list all the major functions the system will perform. 


3. 
Processing data for the output section. This processing may include transla- 
tion of data to a form usable by the output 
devices 
(for example, translation 
of binary data into EBCDIC). 


Specification of to a microcomputer 
system's 
outputs 
requires an analysis very 
similar to the one performed for the input section. There are three major parameters 
for 
each output channel: 


1. 
The number 
of bits to be transmitted 
by the channel. 


2. 
Data transfer speed at the output channel. 


3. 
Accompanying control information that tells the system when the transmitter 
is demanding 
more data, or is available to handle more data. 


After these parameters have been accounted for, it is necessary to specify how the 
channel will be controlled. 
As with input channels, 
output channels usually have three 
ports of importance: 


1. 
Data ports. These 
ports receive the data to be transmitted 
to the outside 
world. 


2. 
Status ports. These ports contain information 
that indicates when data may be 
transmitted 
to the data ports, whether 
or not errors have occurred 
in the 
channel, and other information 
about the outside world. 


3. 
Control ports. These ports are typically used to initialize the channel's 
mode 
of operation and to control the way in which the channel presents itself to the 
outside world. 


As with input channels, all of these ports may not be necessary to control an out- 
put channel. 


While performing 
the specification process for each of the three major sections, 


there are a number 
of useful techniques 
to remember: 


1. 
In each section, make a list of the possible error conditions that could occur 
and the system's 
response to the error. 


2. 
In each section, make a list of all functions that the section is to handle; e.g., 
make a list of all the input channels, 
all the computational 
functions, 
and all 


the output channels. Upon completion of a particular section, cross-check the 
section with your list, hopefully ensuring that all possibilities for the system 
have been recognized. 


The first specification 
written is not necessarily the last; unless the problem 
at 
hand is fairly simple, it will almost certainly not be the last. The Program Design task 
and the Implementation 
task may reveal that certain functions 
cannot be performed 


given the selected hardware configuration. 
In this event, it may be necessary to modify 


the specification so that the hardware configuration 
is changed or the offending func- 
tions are modified so that the given hardware configuration 
can accomplish them. 


Program 
Design involves 
taking the words in the specification 
and wntmg 
a 
sequence 
of English language steps that describe the method 
that will implement 
the 
specification. Ideally, these English language steps will provide a clear, simple descrip- 
tion of what the system 
will accomplish. 
At this point, 
it may not be immediately 


obvious that a simple description 
is available for all systems. For example, one would 


not expect to find a concise description of IBM's DOS/VS operating system. While this 
may be true for the entire system, in the ideal situation a simple description should be 
available for each individual 
part of the system 
(for example, 
a printer driver or a 


multiword subtraction 
routine). 
When considering 
the number of parts in a very large 


system, one gets a glimmer of the program designer's task: breaking a large specification 
into a very large number 
of much smaller modules. 
While engaged in the Program Design task, keep these suggestions 
in mind: 


1. 
In the future the program may have to expand to provide more capabilities. 
Therefore 
the 
program 
should 
have 
built-in 
expansion 
facilities. 
Such 
facilities would include system subroutines, 
expandable 
tables and lists of 
data, a convenient, 
well-documented 
method 
for adding more functions 
to 


the system, and data structures 
that are reasonably flexible. 


2. 
In a typical design, there is more than one method 
for accomplishing 
any 
given function. In some cases, limitations of the machine force one solution 
to be used. In other cases, time constraints force another solution. Since these 
factors, namely machine and time limitations, 
may not be well-known until 
the Implementation 
task, where the actual coding takes place, it is often wise 
to pursue alternate methods of solving a particular problem during the design 
stage. The benefits 
of finding alternatives 
at this stage are twofold: first, 
should the cited limitations 
prevent one solution, 
the other will already be 
available, and second, you may discover a more efficient solution in the pro- 
cess. 


3. 
During the design, 
it is very important 
to specify what effect a particular 
module will have on other modules, 
and equally important 
to specify what 
effect other 
modules 
will have 
on that 
module. 
This 
interface 
between 
modules 
becomes 
important 
when 
debugging 
and 
integrating 
program 
modules. 


When Program 
Design is complete, 
use the design to review the specification. 


Cross-checking 
the design with the specification may reveal flaws or omissions in the 
design and/or the specification. Be aware of the fact that the design should be reviewed 
on a regular basis. While the Implementation 
and Testing tasks are being performed, 


new information 
could become available that may force a reevaluation 
of the Program 
Design. 


The 
Implementation 
task consists 
of taking 
the 
English 
language 
algorithm 
specified in the Program Design task and making it work on a specific microcomputer 
system. There are two distinct efforts that go into the Implementation 
task: 


I. 
Coding. This is the process of converting 
the English language steps created 
during the Program Design task into a particular computer 
language. 


2. 
Debugging 
and Integrating. 
This is the process of removing 
errors 
from 
Program Design modules that have been converted 
by the coding phase into 
computer 
language, then integrating these modules into a working system. 


The conversion of the Program Design into a particular computer language can be 
one of a programmer's 
easier tasks. If the Program Design function has been done cor- 
rectly, each separate module will be described by a set of concise English language state- 
ments. Keep the following suggestions 
in mind while coding: 


I. 
Try to use standard subroutines 
or programs whenever possible. Subroutines 
are very useful 
in that they can usually be debugged 
individually. 
After 
removing the bugs from the subroutines 
it is much easier to debug the main 
line of code. In addition, standard subroutines 
make it much easier to add new 
features to a system. 


2. 
Document 
the code as clearly as possible. In addition to comment statements 
which describe 
individual 
modules 
or sections 
of code, labels which have 
mnemonic 
significance are of tremendous 
value. Some assemblers 
limit the 
opportunity 
to do this, as they restrict the number of characters in a label to 
six or fewer. 
In most 
cases, 
however, 
the 
ability to give extraordinary 
mnemonic 
value to the labels for both program and data areas is present and 
should be exercised to the fullest. 


After each of the program design modules has been translated into the appropriate 
computer 
language, a series of checks should be made to ensure that the Coding task 


has been performed 
correctly and to avoid potential difficulties while performing 
the 


Debugging task. This series of checks, sometimes 
referred to as desk checking, is part of 
the Coding procedure, 
but also shares many elements 
with the Debugging task. 


Checks that should be made include: 


1. 
Ensure that the code contains all of the program design modules. 


2. 
Ensure that all decisions included in the program design are included in the 
code. Check the logic at all of the decision points to ensure that the branches 
will be performed 
in the correct manner. 


3. 
Ensure 
that each program design module 
has been provided 
with enough 
information 
to allow it to run correctly. This check can be performed for each 
module by determining 
what this module expects other modules to supply in 
terms of: 


The contents 
of the registers 
The contents 
of data structures 
The state of I/O devices used by this module 
Status settings 


4. 
Ensure that each module provides subsequent 
modules with the correct infor- 
mation. This check can be performed 
for each module by determining 
what 
this module must supply to other modules in terms of: 


The contents 
of the registers 
The contents 
of data structures 
The state of I/O devices used by subsequent 
modules 
Status settings 


5. 
Ensure that code has been entered into this module to handle the following 
situations: 


Errors 
Special cases 
Boundary cases 
Trivial cases 


The Debugging and Integration 
task consists of removing errors from the code 
and then integrating debugged modules into a final working system. The functions per- 
formed during the Debugging task are very similar to the functions performed 
during 
desk checking. 
The Debugging 
task is different 
in that the task is performed 
while 
examining code that is running 
on the system hardware or a simulator 
for the system 
hardware. There are a series of tools used in the debugging process that are not available 
while desk checking. These tools are usually, but not always, provided by a software 
module called the Debugger. Typical features provided by a Debugger include: 


A Single Step facility. This facility allows a user to execute individual instruc- 
tions following the program logic. 


Examine/ Alter the contents 
of a memory 
location or a register. This facility 
allows the user to view memory/register 
contents 
and optionally alter them. 


A Breakpoint facility. This facility allows the user to interrupt the execution of 
the program which is being debugged, depending upon some condition. Typi- 
cal breakpoint 
conditions include reference to a particular address, for either 
operand reference or instruction 
fetch. 


When debugging a program, the following suggestions 
should be kept in mind: 


1. 
Begin the 
debugging 
process 
by debugging 
commonly 
used 
or system 
subroutines. 
If the lowest-level routines in a software system are known to be 
functioning appropriately, discovering the source of an error is simplified, as it 
can be assumed 
that either the mainline code is in error or it is using the 
system subroutines 
in an incorrect fashion. 


2. 
If it is possible, attempt to debug each area of the specification individually. It 
is appropriate 
to debug each section of the Input portion of the specification 
individually, 
followed by each section 
of the Computation 
portion 
of the 
specification, followed by each section of the Output portion of the specifica- 
tion. When sections of the specification are debugged individually, it is possi- 
ble to view each section without 
interference 
from other 
portions 
of the 
system. 
Theoretically, 
when all of the individual 
modules 
have been de- 
bugged, the Integration 
phase will only need to debug the way in which the 
program modules interface to each other. 


When all of the individual modules have been debugged, 
the Integration 
phase 
begins. In this phase, 
individual 
modules 
are combined 
into a subsystem 
and then 
debugged as a subsystem. 
For example, all the program design modules which affect the 
input portion of the program can be combined 
and debugged. 
As each subsystem 
is 
debugged, 
it can then be combined 
with other subsyStems 
until the final system is 
debugged. As noted above, the only function that should be performed by the Integra- 
tion phase is to ensure that the interface between modules (and eventually subsystems) 
is handled correctly. 


At anyone 
of the stages of Implementation, 
it may be necessary to return to the 
Program Design or even the Specification task. Consider these examples: 


1. 
During the Coding stage, it becomes obvious that the code necessary to pro- 
vide the specified functions will require more memory than has been provided 
for in the hardware design. First, return to the Program Design task to deter- 
mine if alternate 
methods would allow for the use of less memory space. If 
this doesn't 
solve the problem, it is time to go back to the Specification task 
and reconfigure 
the system in some fashion. 


2. 
During the Debugging 
phase, it is noticed that the system cannot respond 
quickly enough if an attempt is made to run all the devices the system is sup- 
posed to control. Note that this difficulty might not be obvious in the early 
stages of Debugging, since the input and output portions of the specification 
will typically be debugged separately until Integration 
is performed. 
In this 
case, return to the Coding task to see if the execution 
time for the input or 
output code can be reduced. If this fails, return to the Program Design task to 
determine 
if a more efficient algorithm is available. If, horror of horrors, 
this 
fails, return to the Specification stage to revamp the system. 


The Testing task consists 
of thoroughly 
exerCISing the system by introducing 
special sets of data and verifying that the correct results are produced. This task is very 
common in environments 
where there are large numbers 
of programmers. 
For exam- 
ple, before any self-respecting 
software house releases a new version of an operating 
system, the new system will have been thoroughly 
scrutinized. 
Before an automobile 
manufacturer 
releases a version of an on-board 
computer 
system, 
rigorous tests will 
have been performed. 
But testing is often overlooked in situations where few assembly 
language programmers 
exist. A major reason for the neglect of testing is that it can be 
very time consuming 
and therefore 
very expensive; 
in addition, it is not a well under- 
stood art. 


Hopefully a significant portion of the Testing task can be accomplished during the 
debugging phase of the Implementation 
task. During the debugging section, for exam- 
ple, each module will be tested by executing the module using data that results in boun- 
dary conditions, thereby exercising the module's 
ability to make decisions. As an exam- 
ple, suppose a module is coded to perform one function if the first byte of a data block is 
in the range 3016 to 3916 inclusive, another function if the first byte is in the range 411( to 
4616 inclusive, and a third function if the byte is not in either of the ranges. Typical test 
data might well include blocks with a first byte of: 


These blocks would test the system's ability to distinguish between different types 
of data. 


When deciding upon test data to submit to the system, keep the following sugges- 
tions in mind: 


1. 
Three basic types of data to enter are: 


The typical stream of data that the system normally would encounter 


A series of boundary conditions that exercise the system's 
ability to per- 
form decisions correctly 


A random 
selection 
of data containing 
both legitimate and illegitimate 
data 


2. 
The data should be presented to the system at the following speeds: 


The typical data rate that the system would normally encounter 


The fastest data rate at which the system is supposed to function 


A random selection of data rates 


The Documentation 
task consists of writing down all information 
pertinent to the 


system. There are three basic elements 
in the documentation 
of a system: 


1. 
Documentation 
of the program. As was noted in the discussion of the Imple- 
mentation task, it is very important to explain how the code works, module by 
module, and in some cases why the code works the way it does. This sort of 
documentation 
allows new readers of the code to easily familiarize themselves 
with the code. In addition, if they desire to alter the code, good documenta- 
tion may allow reasonable, 
informed decisions on how the alteration should 
be made. Program documentation 
helps refresh the memory of the original 


coder, who may be examining a program written in the distant past. 


2. 
A System Guide. 
The System Guide 
should 
include a description 
of the 


program's 
design, a description 
of how to modify the program, 
and a brief 


summary of what the system expects to see in the outside world; i.e., what is 
driving the input lines and what is receiving the data on the output 
lines. 
Hopefully, a System Guide will be fairly simple to put together, as the major 
elements should have been written up, at least in note form, during the pre- 
vious tasks. 


3. 
A User's Guide. This may be the most important 
piece of documentation. 
If 
the code is very well documented, 
and if a superlative System Guide has been 
written, 
then 
other 
programmers 
may be able to modify or improve 
the 


program; but if there is no User's Guide, it may be impossible for anyone to 
use the program, 
in which case no one will care to modify or improve 
the 


code, and all the effort will have been wasted. User's Guides are especially 
important 
in systems where external users may write programs that interface 


with the system. 
In this case, it is of extreme 
importance 
to notify users, 
typically through an update of the User's Guide, of any revisions or additions 
to the system. 


The Maintenance 
task consists 
of altering 
the program 
to accommodate 
new 
equipment 
or new processing requirements; 
in essence, keeping the program function- 


ing in a changing 
environment. 
The Maintenance 
task can be simple or complex, 


depending on the change in the environment. 
Examples of simple tasks include: 


1. 
A new piece of hardware is being installed in the system, replacing an out- 
moded piece of equipment. 
The I/O interface is remarkably similar to the old 


hardware; 
in fact, the change involves only the transposition 
of a few status 


lines. In this case, the Maintenance 
task would involve altering a few lines of 
code in the program, debugging the code using the new hardware, and writing 
the appropriate 
additions to the documentation. 


2. 
System output is going to a diskette file which will, at some later time, be pro- 
cessed by a more sophisticated system. A new operating system release on the 
more sophisticated 
system requires that two previously unused bytes in a dis- 


kette file be used for some more significant purpose. Since this possibility was 
considered 
during the Program Design phase, the Maintenance 
task in this 
case will only require minor alterations 
to the Computation 
section of the 
Specification task and the associated Program Design, Implementation, 
and 
Documentation 
tasks. 


More complex tasks might include: 


1. 
A new piece of hardware is being added to the system. In contrast to the pre- 
vious example, 
this equipment 
is nothing 
like other system equipment; 
in 
fact, it makes new demands 
on the interrupt 
structure, 
the timing of the 
system, and the processing abilities of the system. In this case, the Mainte- 
nance task may require extensive effort in each one of the programming tasks 
that has been identified. 


2. 
The marketing 
department 
has decided to attach a veritable phalanx of 80- 
megabyte disk drives to your microprocessor 
based system. The Maintenance 
task in this case would probably include all of the programming 
tasks from 
Specification to Documentation, 
or conceivably 
an orderly removal 
of the 
appropriate 
marketing 
personnel 
from the gene pool. 


The ability to perform the Maintenance 
task with relative ease is directly propor- 
tional to the care which was taken in the Program Design and Implementation 
stages. If 
the Program Design stage left no easy way to add features or provided no general system 
modules, then additions to the system will probably prove difficult at best. If, during the 
Implementation 
process, no reasonable 
documentation 
on the hows and whys of the 
program were provided, introducing new elements of the Program Design into the code 
will be very tortuous 
indeed. 
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Some Program Examples 


Consider the specification and program design tasks for a sort program module. 


This program reads data records from a file on a tape drive, sorts keys extracted from the 
records, then writes a key file to the tape following the data file. The actual code for this 
program is presented 
in Chapter 6. 
In this exmple, 
a very simple I/O interface 
will be assumed. 
A general block 
diagram for the I/O interface is shown here: 
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The tape controller 
transfers 
I28-byte blocks of data to and from the tape. The 
controller adds parity bits (for a 9-track tape) and a checksum to a block written to the 
tape. The controller processes this error-detecting 
information 
when a read operation is 
performed 
and sets error bits accordingly. 


Transfers to and from the tape are performed 
as follows: 


I. 
The system requests a read or write operation. 


2. 
The system waits for the controller to be ready to transfer a byte of data. 


3. 
The system transfers a byte to/from 
the controller's 
data port. 


4. 
If 128 bytes have been transferred, 
the tape controller 
sets a flag indicating 
that the entire block has been transmitted. 
If the transfer is not complete, the 
system returns to step 2. 


The tape controller will be run with a very simple command structure. The follow- 
ing command byte is sent to the tape controller's 
command port to initiate action by the 
tape controller: 


{ 
0 - No Read operation 
1 - Perform a Read operation 
j 0 - No Write operation 
11 - Perform a Write operation 
j 0 - No Rewind operation 
I 1 - Perform a Rewind operation 


After the system sends a command 
to the tape controller, 
the system reads a 
status byte from the controller. 
This byte is of the following form: 


If the system has issued a command for a read operation or a write operation, 
the 
appropriate bit (bit 0 for a read operation or bit I for a write operation) 
is sampled. If the 


tape controller is ready to send/receive 
data, the system performs a read or write opera- 


tion from the tape controller's 
data port. After 128 read/write 
operations, 
bit 2 will 


become a I, signalling the completion 
of the operation. 


If the system has issued a command for a rewind operation, 
bit 2 of the status port 


will indicate that the rewind operation has been completed. 


Given the previous description of the characteristics 
of the tape controller, 
input 
parameters 
may be specified as follows: 


1. 
Data path width. Data arrives from the tape controller one byte at a time. Each 
read command 
from the system allows a 128-byte block to be read from the 
tape. 


2. 
Data transfer speed. In this example, the data will be arriving synchronously. 
After the Data Byte Available bit goes high, the data may be input at the max- 
imum CPU rate. 


3. 
Accompanying control information. 
In this example, the tape controller does 
not interrupt 
the system. The system reads the tape controller status port to 
determine 
if data is available. 


In this section of the specification, the following elements 
will be considered: 


The format of the data records that are read from the tape 


The method by which the keys will be sorted 


The format of the data records that are written to the tape. 


Each data record read from the tape will consist of 128 bytes. There are three fields 
of interest in each data record read from the tape: 


1. 
Record number. This is a two-byte field which uniquely identifies the record. 
Record numbers 
may be in the range 000016 
- FFFEI6. 
Record numbers 
FFFF 
16 designates an end-of-file record. 


2. 
Key. This is a ten-byte field. This field may contain data describing the record, 
and does not have to be unique to the particular record. In this example, we 
will assume that these ten bytes represent 
an individual's 
last name. 


3. 
Data. The remaining 
116 bytes in a record contain data. 


01 __ 


11 
__ 


21 __ 


Note that the size of the data record is conveniently 
equal to the size of the block 
that is read from the tape. 


The sorting method 
used will be the diminishing 
increment 
sort, or Shell sort. 


This is a commonly 
used sort algorithm 
which is described 
in detail in Sorting and 
Searching, by D.W. Knuth. 
The collating sequence 
used will be the ASCII collating 
sequence. 
The keys will be sorted in ascending order. 


The basic philosophy 
of the diminishing 
increment 
sort is to sort progressively 


larger sublists using a straight insertion technique 
until the final pass, when the entire 


list is sorted using straight insertion. The advantage of this sort is that as the sublists are 
sorted, the entire list becomes more ordered. Therefore, 
when the entire list is sorted 
during the final pass, fewer exchanges are necessary, and that reduces execution 
time. 


For example, consider this IO-element list: 


10 
13 
8 
14 
4 
19 
11 
6 
13 
7 
so"~_ll 
) I 


And the final pass would sort the entire list. 


The basic algorithm is: 


Given: 
N records. 
In this case, the records are 12 bytes long and consist of a 
record number 
and a key field. 


There are two variables of interest in this algorithm. 


Increment: 
In the diminishing 
increment 
sort, a set of increments 
is chosen that will 


help determine 
the number of elements in a sublist. In this case, the incre- 


ments will be 


We will call the variable that contains N/2, then N/4, then finally 1 (for the 
final pass, which sorts the entire list) the increment. 


For each value of increment, 
i.e., for each sorting pass, this variable counts 
from (N - 
Increment) 
to N. This determines 
the number of sorts that are 
performed 
in each pass. 


The algorithm operates as follows: 


Subsort 
counter: 


1. 
Set Increment 
= N 


Do Steps 2 through 
12 until Increment 
= O. 


2. 
Increment 
= Incrementl2 


Sort each sublist using a straight insertion sort. 


3. 
Subsort counter 
= N - 
Increment 


Do Steps 4 through 
12 until Subsort counter 
= N + 


4. 
Subsort counter 
= Subsort counter + 


5. 
Keytemp = Key (Subsort counter) 


6. 
Recordtemp 
= Record (Subsort counter) 


7. 
Index = Subsort counter 
- 
Increment 


8. 
Compare 
Keytemp with Key (Index) 


If Keytemp ~ Key (Index), 
then go to Step 12 else go to Step 9 


9. 
Record (Index + Increment) 
= Record (Index) 


10. 
Index = Index - 
Increment 


11. 
If Index > 0, then go to Step 8 else go to Step 12 


12. 
Record (Index + Increment) 
= Recordtemp 


Each data record written to the tape consists of 12 bytes. There are two fields of 
interest in each data record written to the tape: 


1. 
Record number. This is a two-byte field which is identical to the record num- 
ber field in the input record format. 


2. 
Key. This is a ten-byte 
field which is identical to the key field in the input 
record format. 


These records are organized as follows: 


01 __ 


'1 __ 


21 __ 


Note that 128, the number of bytes in a tape block, is not a multiple of 12, the number of 
bytes in an output record. Therefore 
some algorithm must be used to pack the output 
records into a tape block. This algorithm is discussed later, in the program design sec- 
tion. 


Given 
the previous 
description 
of the tape controller's 
characteristics, 
output 
parameters 
may be specified as follows: 


1. 
Data path width. Data is sent to the tape controller 
one byte at a time. Each 
write command 
from the system allows a 128-byte block to be written to the 
tape. 


2. 
Data transfer speed. In this example, the data is sent synchronously. 
After the 
Ready for Data Byte bit goes high, data may be sent to the controller 
at the 
maximum 
CPU rate. 


3. 
Accompanying control information. 
In this example, the tape controller does 
not interrupt 
the system to signify that it is ready for data. The system reads 
the tape controller 
status port to determine 
if the tape controller is ready for 
data. 


In this example, the only errors of concern are tape errors. These errors will be 
processed 
by the read/write 
tape subroutine. 
This processing 
will be discussed 
in 
Chapter 6. 


By examining 
the task that the program will perform, 
it appears that there are 
three major functions that will comprise the program: 


Reading records from the tape and extracting the key from each record 


Sorting the keys 


Writing the sorted keys back to the tape. 


None of the above modules is very complex, therefore 
flowcharts will be used to 
describe each of them. 


The module that reads the tape contains only one decision point. As the module is 
reading records from the tape, it examines each record to ascertain whether or not the 
record is an end-of-file record (record number 
= FFFF16). 
If an end-of-file 
record is 
detected, control is passed to the sort module; otherwise the record number and key will 
be extracted from the record and saved in a temporary area where they will be processed 
by the sort module. The next record is then read from the tape. 


Read a record 
from tape into 


buffer area 


Extract record no. 
& key from record. 
Sort 
Save in sort area 


Increment = 


Record number 


Recordtemp = 
Record 
ISubsort counter! 


Index = Subsort 
counter - 
Increment 


Record (Index 
+ Increment) 
= Record (Index) 


Index = 
Index - 
Increment 


Record (Index 
+ Increment) 
= Recordtemp 


Subsort counter 
= N - 
Increment 


Subsort counter 
= Subsort 
counter + 1 


The module which writes the key file to the tape is not as straightforward 
as the 
module which reads from the tape. There are two decision points in this module. The 
first decision has to do with filling a tape block. Since it would not be very efficient in 
terms 
of tape space to write a 128-byte block for each 12-byte record, 
records are 
organized in a buffer until 128 or more bytes have been saved. When 128 bytes have 
been saved, the decision point allows the buffer to be flushed to the tape. The second 
decision point involved 
decrements 
the number 
of records. When all of the output 
records have been moved, an end-of-file record (record number = FFFFI6) 
is appended 
to the buffer and then written to tape. 


Move a record 
to tape 
output buffer 


Write buffer 
to tape 


Adjust extra 


bytes 


The 8086 
Assembly Language 
Instruction Set 


The 8086 is Intel's first 16-bit microprocessor. 
When introduced 
in 1978 it was 
significantly more powerful than any prior microprocessor. 


The 8086 assembly language instruction 
set is upward compatible with 8080A - 
but at the source program level only. That is to say, every 8080A assembly language 
instruction 
can be converted 
into one or more 8086 assembly language instructions. 
There is no reason why anyone would try to convert 8086 assembly language instruc- 
tions, one at a time, into one or more 8080A assembly language instructions, 
but if you 
did, you would soon become hopelessly tangled in conflicting memory allocations and 
special translation 
rules. That is why we say that the 8086 and 8080A assembly language 
instruction 
sets are "upward" 
compatible. 
The 8086 and 8080A assembly language instruction sets are not compatible at the 
object code level, which means that 8080A programs stored in read-only memory are 
useless in an 8086 system. 
The 8085 and 8080A assembly language instruction 
sets are identical, with the 
exception of the 8085 RIM and SIM instructions. 
The 8085 RIM and SIM instructions 
cannot be translated 
into 8086 instructions. 
This is because the RIM and SIM instruc- 
tions use the serial I/O logic of the 8085, which has no 8086 counterpart. 
Without the 
RIM and SIM instructions 
the 8085 and 8080A assembly language instruction 
sets are 
identical; therefore 
the 8086 assembly language instruction 
set must also be upward 
compatible with the 8085 assembly language instruction 
set - 
apart from the RIM and 
SIM instructions. 
The 8085 and 8080A assembly language instruction 
sets are object code compati- 


ble - 
with the exception of the 8085 RIM and SIM instructions. 
That is to say, a pro- 
gram existing in read-only memory could be used with one microprocessor 
or the other. 


The 8080A assembly language instruction 
set is a subset of the Z80 assembly 
language instruction set. That is to say, the Z80 will execute an 8080A object program - 
but the reverse is not true. The 8080A cannot execute Z80 programs when the full Z80 
instruction 
set is used. The 8086 assembly language instruction 
set is not upward com- 
patible with the Z80 assembly language instruction 
set. 


As a historical note, it is worth mentioning 
that the 8008 microprocessor, 
which 
preceded the 8080A, was also compatible only at the source program level. That is to 
say, there is an 8080A assem bly language instruction 
for every 8008 assem bly language 
instruction, 
but the two microprocessor 
object code sets are not the same. 
The various instruction 
set compatibilities 
that we have described may be illus- 
trated as follows: 


(Excluding RIM and 


SIM instructions) 


I 
______ 
...J 


-- 
Source program of lower microprocessor 
can be assembled to generate upper 
microprocessor object program 


- 
- 
- 
Lower microprocessor instruction set is a 
subset of upper microprocessor 
instruction 
set at the object program level 


These are the most interesting innovations 
to be found in 8086 hardware design: 


1. 
8086 Central Processing Unit logic has been divided into an Execution 
Unit 
(EU) 
and 
a 
Bus 
Interface 
Unit 
(BIU). 
These 
two 
halves 
operate 
asynchronously. 
The Bus Interface Unit handles all interfaces with the exter- 
nal bus; it generates 
external 
memory 
and I/O addresses 
and has a 6-byte 
instruction 
object code queue. Whenever 
the EU needs to access memory or 
an I/O device, it makes a bus access request to the Bus Interface Unit. Provid- 
ing the Bus Interface 
Unit is not currently 
busy, it acknowledges 
the bus 
access request from the EU. When the Bus Interface Unit has no active pend- 
ing bus access requests from the EU, it performs instruction 
fetch machine 
cycles to fill the 6-byte instruction 
object code queue. 
The CPU takes its 
instruction 
object codes from the front of the queue. Thus instruction 
fetch 
time is largely eliminated. 


2. 
The 8086 has been designed to work in a wide range of microcomputer 
system 
configurations, 
ranging from a simple one-CPU 
system to a multiple-CPU 


network. To support this wide flexibility, a number of 8086 pins output alter- 
nate signals. This may be illustrated as follows: 


These signals 
do not change 


Simple control output 
for use in one-CPU system 


The same pins output these two sets of signals, based on a level of MN/MX. 
This wholesale reallocation of signals was a highly imaginative and innovative 
first for the microprocessor 
industry. 
3: 
The 8086 has built-in logic to handle bus access priorities in multi-CPU 
con- 
figurations. 
(This is not a new concept; National Semiconductor's 
SC/MP 
has 


had it for years.) 


4. 
In multi-CPU configurations, 
each 8086 CPU can have its own local memory, 
while simultaneously 
sharing common memory. The common memory may 


be shared by all CPUs, or by selected CPUs. 


5. 
The 8086 has been designed 
to compete 
effectively in program 
intensive 
applications that have been the domain of the minicomputer. 
Up to a million 
bytes of external memory can be addressed directly. All memory addressing is 
base relative; this memory addressing technique 
naturally generates relocata- 


ble object programs. 
(Relocatable 
object programs can be moved from one 


memory 
address 
space to another 
and re-executed 
without 
modification.) 
Also, since the 8086 utilizes stack-relative 
addressing, 
re-entrant 
programs 
are easily written. (Re-entrant 
programs can be interrupted 
in mid-execution 
and re-executed. 
For example, a subroutine 
which calls itself is re-entrant; 
a 


program which can be interrupted 
in mid-execution 
by an external interrupt, 
and then re-executed 
within the interrupt service routine, is also re-entrant.) 


6. 
The 8086 uses prefix instructions 
that modify the interpretation 
of the next 
instruction's 
object code. 


The 8086, like its predecessor, 
the 8080A, is really one component 
of a multiple- 


chip microprocessor 
configuration. 
In addition 
to the 8086 microprocessor 
itself, you must 
have an 8284 Clock 
Generator/Driver. 
You could create the required clock signal using alternative logic, but 
it would be neither practical nor economical to do so. 
The third device necessary 
in some 8086 microprocessor 
configurations 
is the 
8288 Bus Controller. 


You will usually have an 8288 Bus Controller between an 8086 and its system bus 
(or busses), just as you will usually have an 8228 System Bus Controller 
between an 
8080A and its system bus. In the case of the 8086, however, you can dispense with the 
8288 Bus Controller 
in single-bus configurations 
- 
and pay no penalty for it. 


Chapters 6, 7, 8, and 9 discuss basic 8086 hardware, 
single-CPU configurations, 


the Multibus· 
and multi-CPU 
configurations. 


Next we will specify a program module which interfaces a system to a serial input/ 
output channel. We will also look at design tasks associated with creating this program 
module. 


The following is a general block diagram for a serial input/output 
channel: 


Data 
" 
I 


L 


Status 
Serial Line In 
510 Device 
Intel 
Terminal 
System 
8251A 
(CRTI 


Control 
•.. 
Keyboard) 


..• 
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Modem 


In this example, the serial input/output 
(SIO) channel is an Intel 8251A Program- 
mable Communication 
Interface. It is assumed that the 8251A is connected to a com- 
munications 
terminal; the communications 
terminal has a CRT on which it displays data 
transmitted 
by the system. Data from a keyboard is transmitted 
by the channel to the 
system. Data is not buffered at the terminal on input or output. Data is transmitted 
and 
received asynchronously. 


The program module, which will also be referred to as the driver, connects operat- 
ing system software with the 8251A. This may be illustrated as follows: 


I 
I 
I 
I 
I 
I 
l- 
---J 


Commands 
Data 


Operating 
Software 
I Status 


System 
Driver 
8251A 


Software 
Data 
o. 
rControl 
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Operating system software sends commands 
and/or 
data to the 1/0 driver pro- 
gram module; this can be handled in a variety of ways. They include: 


1. 
Placing the command or data in a register. For example, one register could be 
assigned to holding commands, 
while data passes through another 
register. 


2. 
Using a task block. The task block could contain the command and data or the 
command and a pointer to the data. The task block could be located at a fixed 
memory location, or it could be pointed to by one of the registers. 


3. 
Via the stack. System software could push the equivalent of a task block (i.e., 
commands 
and data/pointers) 
onto the stack. 


Selecting one of the above techniques 
is usually a processor-dependent 
decision. 


Since the present discussion 
is not processor-dependent, 
the rationale for selecting a 
parameter 
passing technique 
will be deferred to a later chapter. 


The SIO device used in this example is an Intel 8251A. This device requires the 
following input specifications: 


1. 
Data path width. The 8251A allows 5-, 6-, 7-, or 8-bit characters. 
In this 
example, 
an 8-bit data path is required 
by the device for commands 
and 
status. To allow for a future system using a different size data path, the pro- 
gram design will allow the size of the data path to be specified. 


2. 
Data 
transfer 
speed. 
In this 
example, 
the 
data 
will be 
transferred 
asynchronously. 
Only the maximum data transfer rate can be specified. In this 
example, 9600 baud is specified as the maximum 
data transfer rate. 


3. 
Handshaking 
protocol. In this example the SIO channel does not interrupt the 
microprocessor, 
rather the microprocessor 
polls the channel to determine 
if 
data is available. 


Next the 1/0 driver must consider actual 1/0 channel operations. 
In addition to 
data transfer, 
controls must be transmitted 
to the 1/0 channel, 
in this case an 8251A, 


and status must be received from it. 


Data is known to be available at this I/O port when an appropriate status bit has 
been set to 1. In the case of the 8251, the RxRDY 
(Receiver Ready) bit of the status 
port must be set to 1. 


The 8251A is initialized to a known state by writing information 
to the control 
port. At least two bytes of control information 
are necessary to initialize the 8251 A. 
Control information 
is sent in the following sequence: 


1. 
Mode Select Byte 


2. 
Sync Character 
(Synchronous 
mode only) 
3. 
Sync Character 
(Synchronous 
mode only) 
4. 
Command 
Select Byte 


In this example, the 8251 A will be operating in the asynchronous 
mode, therefore 
a two-byte initialization sequence is needed; it is: 


1. 
Mode Select Byte 


2. 
Command 
Select Byte 


152151 
I EP IPENI L2I 
L1 I B21 B11 


• t 


Baud Rate Factor 


0 
1 
0 
1 


0 
0 
1 
1 


Sync 
(1 x) 
(16x) 
(64x) 
Mode 


Character Length 


0 
1 
0 
1 


0 
0 
1 
1 


5 
6 
7 
8 
Bits 
Bits 
Bits 
Bits 


Parity Enable 


1 - Enable 
0- 
Disable 


Even Parity Generation/Che 


1 - Even 
0- 
Odd 


Number of Stop Bits 


0 
1 
0 
1 


0 
0 
1 
1 


Invalid 
1 
lY2 
2 
Bit 
Bits 
Bits 


(Only affects 
Tx; Rx never 


requires more than one stop bit) 


{ 


Transmit 
Enable 
1 = enable 
o = disable 


Data Terminal Ready 


"high" 
will force DTR 
output to zero 


{ 


Receive 
Enable 
1 = enable 
o = disable 


Send Break Character 
1 = forces TxD "low" 
o = normal operation 


Error Reset 
1 = reset error flags 
PE, OE, FE 


Request to Send 
"high" 
will force RTS 
output to zero 


{ 


Internal 
Reset 


"high" 
returns 8251 A to 
M ode Instruction 
Format 


{ 


Enter 
Hunt Mode 
1 = enable search for Sync 
Characters 
(Has no effect in Async model 


7 
6 
5 
4 
3 
2 
1 
0 
~Bit 
No. 
= 
-- 
----l:={These 
two bits are hardware dependent. 


In this system, a x 16 clock will 
be used 


These two bits specify the number of data 


{bits 
per character. In this case, 8 bits of data are 


exchanged 


This bit disables parity 


Given bit 4, this is a "don't 
care" bit 


{These 
two bits are hardware dependent. 


In this system, 
1Y, stop bits will 
be used 


L 


Enable the transmitter 


Turn on Data Terminal 
Ready 


Enable the receiver 


Do not send a break character 


Reset the error flags 


Turn on Request to Send 


Do not return to the mode instruction 
format 


Since asynchronous 
operation 
is specified, 


this is a "don't 
care" 
bit 


Additional features of 8251 A programming are discussed during the implementa- 
tion section in Chapter 
6, 
The status port supplies information 
on the state of the 8251, and the state of the 
device to which it is connected. 


L 


When a byte is read from the status port, the following information 
is transferred 
to the system. 


I DSR I~YE~-I 
FE 
I 
DE I PE 
~~~TyIRXRDyITXRDyl 


{ 


{ 


{ 


{ 
I 


i 


i 


~ 


This bit indicates to the system that the 8251 A 
transmitter is ready for a data character. If this 
bit is a 1, data may be sent to the 8251 A. If this 
bit is a 0, data may not be sent to the 8251 A. 


This bit 
indicates 
that 
the 8251A 
receiver 


buffer contains a data character that is ready 
for transfer to the system. 


This 
bit 
indicates 
that 
the 
8251A 
has no 


characters 
to transmit. 
If this bit is a 1, the 


transmitter 
is empty. 
If this bit 
is a 0, the 


transmitter 
is sending data. 


Parity Error. The PE flag is set when a parity 
error is detected. It is reset by the ER bit of the 
Command Instruction. PEdoes not inhibit opera- 
tion of the 8251 A 


Overrun Error. The OEflag is set when the CPU 
does not read a character before the next one 
becomes available. It is reset by the ERbit of the 
Command 
Instruction. 
OE does 
not 
inhibit 


operation of the 8251 A; however, the previous 
overrun character is lost. 


Framing Error (Async only). The FE flag is set 
when a valid Stop bit is not detected 
at the 


end of every character. It is reset by the ER 
bit of the Command Instruction. 
FE does not 


nhibit the operation of the 8251A. 


This bit 
indicates 
whether 
the 
8251A 
has 


detected 
a 
SYNC 
character. 
This 
bit 
is 


meaningful 
only 
if 
the 
8251 A 
has 
been 


nitialized 
to 
operate 
in 
the 
synchronous 


mode. If this bit is a 1, a SYNC character has 
been detected. 
If this 
bit 
is a 0. no SYNC 


character has been detected. 


This bit reflects 
the state of the DSR pin on 


the 
8251 A. If this 
bit 
is 
1, DSR is high. 


{ 


theref~ 
the data set is not ready. If this bit 


is 0, DSR is low, 
indicating 
data set ready. 


This is a modem control line. 


What sort of functions should the driver provide? Will data be translated on input 
and/or 
output? 
These are the functions a real I/O driver will provide: 


1. 
Initialize 
the channel. 
When 
power is applied to the system, 
the 8251 A 
powers up in an unknown 
state. The I/O driver will put the channel 
into a 
known state. 


2. 
Input a single character. When this function is requested, 
the driver reads the 
status port and waits until data is available. When data is available, the driver 
reads the data port and passes the information 
back to the system. 


3. 
Output a single character. When this function is requested, 
the system must 
pass the character to be output, 
or a pointer to that character, 
to the driver. 


The driver reads the status port and waits until the transmitter 
is available. 
When 
the 
transmitter 
is available, 
the driver 
will transfer 
the 
specified 
character to the data port. 


4. 
Check the channel's 
status. 
Perhaps 
the system does not need to read a 
character, 
rather it needs to know if a character is available. Under such cir- 
cumstances 
the system will read the status port contents. 


5. 
Send control information 
to the channel. The system may need to alter the 
state of the channel, 
for example, 
to allow the channel to check for parity 
errors. 


6. 
Input a series of characters from the channel. You may wish to input charac- 
ters until some terminating 
condition 
is detected. 
For example, 
a carriage 
return may constitute 
a terminating 
condition, 
or a fixed number of charac- 
ters may have to be input. Five numeric characters constitute a ZIP Code, for 
example. The I/O driver will read data from the channel. This involves wait- 
ing for data to be available, then reading the information 
present at the data 
port saving tne data in some designated 
place in memory, 
then testing to 
determine 
if the terminating 
condition has been reached. 


7. 
Output a series of characters to the channel. The system may wish to output a 
series of characters until a terminating 
condition is detected. Possible termina- 
tion conditions might include either the detection of a predetermined 
end-of- 
string character 
or the output 
of a specific number 
of characters. 
The I/O 
driver will test for the termination 
condition; 
if the terminating 
condition 
is 
not detected, 
the I/O driver will load data from a specified memory location, 
and send the data to the channel. 


The 8251A uses the control information 
to define the channel's 
output charac- 
teristics. These output specifications need to be defined: 


1. 
Data path width. The 8251A allows data units to consist of either 5, 6, 7, or 8 
bits. In this system, 8 data bits will be transmitted. 


2. 
Data transfer speed. In this case, the maximum 
data rate will be 9600 baud. 


3. 
Handshaking 
protocol. 
In this example, 
the 8251A will not interrupt 
the 
system, rather the system reads the 825 I Status register to determine 
whether 
the 8251A is ready to transmit 
more data. 


We have already described the Data and Control/Status 
ports of the 8251A. Data 
may be sent to the channel 
when the TxRDY 
bit (Transmitter 
Ready) of the Status 
register is 1. 


In this section, none of the specific modules will be complex. Given this fact, we 
will use flowcharts to describe each module of the I/O driver. 


The Initialization 
routine contains only one major decision point. If the standard 
initialization is requested, 
a pointer to a standard initialization sequence will identify the 
information 
which must be sent to the control port. As an alternative, 
a "custom" 
initialization 
sequence 
may be executed; 
in this case the user will have to provide the 
initialization 
sequence and a pointer to it. 


Standard Initialization 
or User Specified 


Output 
Control Byte 


The single-character 
input routine flowchart is illustrated 
below; it calls a read 
channel status routine, waits for data to become available, reads the data from the data 
port, and returns. 


Two considerations 
not 
included 
in the design 
of the single-character 
input 
routine are: 


• 
Handling of 8251 errors. When the read channel status routine is called, the 
error bits within the 8251 Status register can be examined. 
If an 8251 error is 
detected, an appropriate error code is returned 
to the I/O driver by the single- 
character input routine. 


• 
Timeout errors. The driver initializes an appropriate register/memory 
location 
to serve as a timeout clock, then decrements 
the contents of this location each 
time the read channel status routine is called. If the contents 
of the timeout 
register/memory 
location decrement 
to zero, a timeout error code is returned 
to the calling routine. 


If these 
considerations 
are added 
to the 
single-character 
input 
routine, 
the 
flowchart must be modified as follows: 


The 
single-character 
output 
routine 
flowchart 
is illustrated 
below; 
it calls a read 


channel 
status 
routine, 
waits for the transmitter 
to be available, 
writes 
a character 
to the 
data 
port, 
then 
returns. 


As with 
the 
input 
routine, 
error 
and 
timeout 
considerations 
are not included 
in 


the initial design, 
as illustrated 
above. 
In the case of the 8251 A, there 
are no error 
condi- 


tions 
to check 
for since 
the 8251 A reports 
no transmission 
errors 
in its Status 
register. 


But a timeout 
check 
could 
be included 
and 
would 
modify 
the 
program 
flowchart 
as 
follows: 


Initialize 
Channel Timeout 
Value 


The read channel status routine and the send control information 
routine each 
require a very simple flowchart. They may be illustrated as follows: 


A multiple-character 
input routine utilizes the single-character 
input routine to 
read data. On return from the single-character 
input routine, a check for errors is made. 


If the single-character input routine detects an error, this error is passed back to the call- 
ing routine. After saving the character in a location specified by the calling program, the 
multi-character 
input routine looks for a termination 
condition. If the routine has either 
read the number of characters specified by the calling program or has read a termination 
character, 
the multiple-character 
input routine will return to the calling program. 


Input a 
Single 
Character 


The multiple-character 
output routine loads a character from a user specified loca- 
tion. If the character 
is the termination 
character, 
the multi-character 
output 
routine 
returns to the calling program. Otherwise, 
the character is sent to the single-character 
output routine. When the single-character 
output routine returns, a check for timeout is 
made. If a timeout was detected, 
it is passed back to the calling routine. The multiple- 
character 
output 
routine 
then checks to see if it has output 
the specified number 
of 
characters; 
if it has, it returns to the calling program. 


The 8086 instruction 
set has a complexity that is typical for the new generation 
of 
16-bit microprocessors. 
The 8086 instruction 
set consists 
of approximately 
70 basic 
instructions, 
with up to 30 addressing 
modes available for memory reference 
instruc- 
tions. 


Any description 
of a CPU's 
instruction 
set should include these basic types of 


information: 


1. 
What is the CPU configuration; 
i.e., what registers and statuses are available? 


What is the primary use for each register? 


2. 
What instructions 
are available? Obviously, there must be some comprehen- 
sive listing of the instruction set with an associated discussion of each instruc- 
tion's function. This listing may be organized in anyone 
of a number of ways. 


In this chapter, 
we list the 
instructions 
alphabetically 
to help you find 
individual instructions. 
In the next chapter we list instructi.ons according to 


function 
(e.g., all the arithmetic 
operations 
are discussed 
in one section) 


which allows you to examine instructions 
by type or group. 


3. 
What data types does the CPU handle? A simple CPU may require all data to 
be handled in one form, perhaps as bytes. A more flexible CPU may give you 
the option of addressing data as individual bits, bytes, 16-bit words, and 32-bit 
long words. 


4. 
What operand source and destination 
addressing options are allowed? A sim- 
ple microprocessor 
may allow memory to be addressed 
only by instructions 


that move data between memory and CPU registers, while all operations 
on 
data 
require 
operands 
to 
reside 
in 
CPU 
registers. 
A 
more 
complex 


microprocessor 
may allow one operand to be fetched from memory while the 


other operand resides in a CPU register. In some cases, the CPU may allow 
both operands 
to reside in memory. 
Available memory 
addressing 
options 
must be evaluated 
when determining 
the significance of memory operands. 


5. 
What addressing modes are available for which instructions? 
Knowing which 
addressing modes are available for a given instruction 
is a key to the effective 


utilization of the instruction set. However, should we attempt to describe each 
possible addressing mode for each instruction, 
this book would only be availa- 


ble in IS volume sets. Therefore, 
a section on Addressing Modes precedes the 


listing of the instruction 
set. 


6. 
How do various groups of instructions 
affect the CPU's status register? To 
evaluate any sort of conditional 
expression 
in assembly language, you must 


know how to translate the conditional into assembly language. Knowing how 
instructions 
affect status 
flags allows a programmer 
to write conditional 
expressions 
in assembly language. 


7. 
In certain cases, other information 
may be important; 
e.g., the number 
of 
cycles that a particular instruction 
takes to execute or the number of program 
memory 
bytes 
the 
instruction 
occupies. 
In this case, 
each 
instruction's 
description will specify the number of cycles required for execution. The num- 
ber of bytes each instruction 
requires, however, is in some cases very depen- 
dent on the addressing mode specified. 


For our discussion 
of the 8086 instruction 
set, we will proceed in the following 
order: 


1. 
Discussion of the 8086 registers and the 8086 Status register, and discussion 
of how status is affected by various groups of instructions. 
The Status register 
is also referred to as the Flags register or the Program Status Word. 


2. 
Discussion 
of the 8086 addressing modes. 


3. 
Discussion of each 8086 instruction. 
This section will be preceded by a sum- 
mary of the symbols and terms used to describe each instruction. 


The 8086 has four 16-bit general purpose registers, two 16-bit Pointer registers, 


two 16-bit Index registers, one 16-bit program counter, 
four 16-bit Segment registers 


and one 16-bit Flags register. These registers may be illustrated as follows: 


o ...--One 
16-bit register 
o ...-- 
Two 8-bit registers 


A 
AX (= AH. AU Primary Accumulatods) 


H. L 
8X (= 8H. BU Accumulatods) and Base register 


B. C 
CX (= CH. CU Accumulatods) and Counter 


D. E 
DX (= DH. DU Accumulator(s) and I/O Address 


"-v-1 ~tL'"~, 
~m~ 
""'" 
" 'S·"" ro,""" 


----~--~----These 
names apply to 8-bit registers 


o ",,--Bit 
No. 


SPI~ 
IStack Pointer (SPI 


Base Pointer (BP) 
-------~ 
15 
0 ",,--Bit 
No. 
Index registers 


1 
=============1 
Source Index (SIl 
Destination Index (DIl 
-------~ 
15 
0 ....--Bit 
No. 


PCI~_-_-_-_-_-_-_-_-_-_-_-_-_-_-.•I Program Counter (PCI 


15 
0 ....--Bit 
No. 


}s.,~,'""",,, 


Code Segment (CSI 
I-------~ 
Data Segment (DS) 
I-------~ 
Stack Segment (SS) 


15 
0 ~BitNo. 


I 
IStatus 


D 
8086 equivalents for 8080A registers 


The general purpose registers may be referenced 
as two separate 8-bit registers. 
This may be illustrated as follows: 


o .-AX 
bit numbers 
o ,-AH. 
ALbit numbers 


o 4--BX 
bit numbers 
o 4--BH, 
BLbit numbers 
I 
BH I 
BL I 
~ 
BX 


o 4--CX 
bit numbers 
o 4--CH. 
CLbit numbers 
I 
CH I 
CL I 
~ 
CX 


o 4--0X 
bit numbers 
o 4--0H, 
OL bit numbers 
I 
OH 
I 
OL 
I 
~ 
ox 


This is an advantage in that instead of performing a 16-bit operation on 8-bit quantities, 
which may take more time and memory space than an 8-bit operation, an 8-bit operation 
may be performed. 
For example, if you want to initialize a register with 20010 and then 
decrement 
it to 0 based on subsequent 
events, an 8-bit register will certainly suffice. 


The general purpose 
registers can be used as operands 
in all the 8- or 16-bit 
arithmetic/logical 
operands. 


The AX register serves as the primary accumulator. 
This register has two unique 
characteristics. 
All 1/0 operations 
are performed 
through 
this register, and operations 
utilizing immediate 
data typically require less memory space when performed 
on this 
register. In addition, some string operations and arithmetic 
instructions 
require use of 
this register. 


The AL register generally corresponds 
to the 8080 A register. 


The BX register is referred to as the Base register. This is the only general purpose 
register 
which is used in the calculation 
of 8086 memory 
addresses. 
All memory 
references which use this register in the calculation of the memory address use the DS 
register as the default segment 
register. The BX register generally corresponds 
to the 
8080A HL register; 
the BH register corresponds 
to the 8080 H register and the BL 
register corresponds 
to the 8080 L register. 


The CX register is referred to as the Count register. This register is decremented 
by string and loop operations. 
CX is typically used to control the number of iterations a 
loop will perform. It is also used for multiple bit shifts and rotates. This register generally 
corresponds 
to the 8080 BC register. 


The CH register corresponds 
to the 8080 B register. The CL register corresponds 
to the 8080 C register. 


The DX register is referred to as the Data register, mostly for mnemonic 
reasons. 


This register provides the I/O address for some I/O instructions, 
a function no other 
8086 register performs. This register generally corresponds 
to the 8080A DE register. 


The DH register corresponds 
to the 8080 D register. The DL register corresponds 
to the 8080 E register. The D register is also used for arithmetic 
operations, 
including 
multiplication 
and division. 


The Pointer registers are used to access data in the stack segment. They may be 
used as operands in all 16-bit arithmetic/logical 
operations. 
The SP register, referred to as the stack pointer, allows the implementation 
of a 
stack in memory. All references to the SP for memory addressing use the SS register as 
the Segment register. This register generally corresponds 
to the 8080 SP register. 


The BP register, referred to as the base pointer, allows data to be accessed in the 
stack segment. 
Typically, this register is used to reference 
parameters 
that have been 
passed via the stack. 


The Index registers are used to access data in data memory. The Index registers 
are used extensively 
by the string operations. 
They may be used as operands in all the 
16-bit arithmetic/logical 
operations. 


The Segment registers are included in all memory addressing calculations. 
Each 
Segment register defines a 64K block of memory in the 8086 memory addressing space, 
which is referred 
to as the Segment register's 
current 
segment; 
e.g., the DS register 
defines a 64K segment 
referred to as the current data segment. 


The CS register is also known as the Code Segment register. During each instruc- 
tion fetch, the program counter contents are added to the CS register contents in order 
to compute the memory address for the instruction 
to be fetched. 
The DS register is also known as the Data Segment register. Every data memory 
reference is taken relative to the Data Segment register, with three exceptions: 


I. 
Stack addresses are computed 
using the stack pointer. 


2. 
Data memory addresses computed 
using the BP register are taken relative to 
the stack segment. 


3. 
String operations 
(which use the or register in the address calculation) 
are 
taken relative to the extra segment. 


The SS register 
is also called the Stack Segment 
register. 
All data memory 
references that use the SP or BP register in the address calculation are taken relative to 
the SS register. Therefore 
all stack-oriented 
instructions 
(e.g., PUSH, POP, CALL, 
RET, and INT) use the SS register as the Segment register. 


The ES register is also referred to as the Extra Segment register. String operations 
compute 
memory 
addresses 
using the DI register that are taken relative to the ES 
register. 


The use of segment 
registers is typically implied by the instruction, 
however 
a 
mechanism 
will be discussed later which allows the implied Segment register to be over- 
ridden in most circumstances. 


The 8086 has one I6-bit Flags register, also referred to as a Status register or Pro- 
gram Status Word. This register may be illustrated as follows: 


Reserved bits. normally 0 


Carry 


Parity 


Auxiliary Carry 


Zero 


Sign 


Trap 


Interrupt enable/disable 


Direction 


Overflow 


The Carry, Auxiliary Carry, Overflow, and Sign statuses are quite standard. 
The Carry status reflects carries out of the high-order 
bit following arithmetic 
operations. 
Carry is also modified by certain shift and rotate instructions. 


The Overflow status is the Exclusive-OR 
of carries out of the high-order 
and 
penultimate 
bits following arithmetic 
operations. 
It implies a magnitude 
overflow in 
signed binary arithmetic. 


The Sign status equals the high-order 
bit following an arithmetic 
operation. 
On 
the assumption 
that signed binary arithmetic 
is being performed, 
a Sign status of 0 
specifies a positive result, whereas a Sign status of 1 specifies a negative result. 


The Auxiliary Carry status is identical to the 8080A status with the same name. It 
represents 
carries out of bit 3 in an 8-bit data unit. 


Subtract 
instructions 
use twos complement 
arithmetic 
in order to subtract 
the 
minuend 
from the subtrahend. 
However, the Carry status is inverted. That is to say, 


following a subtract operation, the Carry status is set to 1 if there was no carry out of the 
high-order bit, and the Carry status is reset to 0 if there was a carry out of the high-order 
bit. The Carry status therefore 
indicates a borrow. 


The Parity status is set to 1 when the low-order eight bits of any data operation 
result has an even number of 1 bits. An odd number of 1 bits causes the Parity status to 
be reset to O. 


The Zero status is set to 1 when the result of a data operation is zero; it is set to 0 
when the result of a data operation is not zero. 


The Direction status determines 
whether string operations will auto-increment 
or 
auto-decrement 
the contents of Index registers. If the Direction status is 1, then the SI 
and 01 Index registers' 
contents 
will be decremented; 
that is to say, strings will be 


accessed from the highest memory address down to the lowest memory address. If the 
Direction status is 0, then the SI and Dllndex 
registers' 
contents will be incremented; 


that is to say, strings will be accessed beginning with the lowest memory address. 


The Interrupt status is a master interrupt enable/disable. 
This status must be 1 in 


order to enable interrupts 
within the 8086. If this status is 0, then all interrupts 
will be 


disabled. 


The Trap status is a special debugging aid that puts the 8086 into a "single step" 


mode. The single step mode is described in detail together 
with 8086 interrupt 
logic, 


since it depends on this interrupt 
logic for its existence. 
The Carry, Auxiliary Carry, Parity, Sign, and Zero statuses are also found in the 


8080A. The Overflow, Direction, 
Interrupt, 
and Trap statuses are new in the 8086. 


The list below identifies tables that describe individual instructions 
and how they 
affect the Flags register. For example, to determine 
how the ADD instruction affects the 
flags, consult Table 3-2. 


Instruction 
Mnemonic 
Table 
Instruction 
Mnemonic 
Table 


AAA 
3-4 
LODS 
3-1 
AAD 
3-10 
LOOP instructions 
3-1 


AAM 
3-10 
MOV 
3-1 
AAS 
3-4 
MOVS 
3-1 
ADC 
3-2 
MUL 
3-6 
ADD 
3-2 
NEG 
3-2 
AND 
3-1 
NOT 
3-1 
CALL 
3-1 
OR 
3-7 
CBW 
3-1 
OUT 
3-1 


CLC 
3-9 
POP 
3-1 
CLD 
3-9 
POPF 
3-12 
CLI 
3-9 
PUSH 
3-1 
CMC 
3-9 
PUSHF 
3-1 
CMP 
3-2 
RCL 
3-8 
CMPS 
3-2 
RCR 
3-8 
CWO 
3-1 
REP 
3-1 


DAA 
3-5 
RET 
3-1 
DAS 
3-5 
ROR 
3-8 
DEC 
3-3 
SAHF 
3-9 
DIV 
3-11 
SAR 
3-7 
ESC 
3-1 
S88 
3-2 
HLT 
3-1 
SCAS 
3-2 


IDIV 
3-11 
SHL 
3-7 


IMUL 
3-6 
SHR 
3-7 


IN 
3-1 
STC 
3-9 
INC 
3-3 
STD 
3-9 
INT 
3-13 
STI 
3-9 
INTO 
3-13 
STOS 
3-1 
IRET 
3-12 
SUB 
3-2 
Jump-on-Conditions 
3-1 
TEST 
3-7 
JCXZ 
3-1 
WAIT 
3-1 
JMP 
3-1 
XCHG 
3-1 


LAHF 
3-1 
XLAT 
3-1 
LOS 
3-1 
XOR 
3-7 


LEA 
3-1 


LES 
3-1 


LOCK 
3-1 


CAll 
CBW 
CWO 
ESC 
HlT 
IN 
Jump-on-Conditions 
JCXZ 
JMP 
lAHF 
lDS 
lEA 
lES 
lOCK 
laDS 


lOOP 
instructions 
MOV 
MOVS 
NOT 
OUT 
POP 
PUSH 
PUSHF 
REP 
RET 
STOS 
WAIT 
XCHG 
XlAT 


The instructions 
in Table 3-2 affect all six of the 8086 arithmetic flags: Overflow, 


Carry, Arithmetic, 
Zero, Sign, and Parity. 


ADC 
ADD 
CMP 
CMPS 


NEG 
SBB 
SCAS 
SUB 


The instructions 
in Table 3-3 affect all the 8086 arithmetic flags except for Carry. 
Overflow, Arithmetic, 
Zero, and Parity are all affected. 


Table 3-3. 
Instructions 
that 
Affect 
all 8086 Arithmetic 
Flags 


except 
Carry 


The instructions 
in Table 3-4 affect all the 8086 arithmetic 
flags. However, 
only 
the values for AF and CF are meaningful. 
The values for Overflow, Zero, Parity, and 
Sign are unknown. 


The instructions 
in Table 3-5 affect all the 8086 arithmetic 
flags. However, 
the 
Overflow 
flag is not meaningful. 
Carry, 
Arithmetic, 
Zero, 
Sign, and Parity are all 
meaningful. 


The instructions 
in Table 3-6 affect all the 8086 arithmetic 
flags. The Carry and 
Overflow flags are not affected in the normal manner. 
Consult 
the instructions 
for a 
description 
of how these flags are set. All other arithmetic 
flags are undefined. 


Table 3-6. 
Instructions 
that Affect All Arithmetic 
Flags, Leaving 


CF and OF Meaningful 


The instructions 
in Table 3-7 affect all the 8086 arithmetic flags. Carry and Over- 
flow are cleared to O. AF is undefined. Zero, Parity and Sign are set in the normal man- 
ner. 


Table 3-7. 
Instructions 
that Affect All Arithmetic 
Flags, 


Leaving AF undefined 


AND 
OR 
SAR 
SHL 


SHR 
TEST 
XOR 


The instructions 
in Table 3-8 affect only the Carry and Overflow 
flags. The 
Arithmetic, 
Zero, Sign, and Parity flags are not altered. 


Table 3-8. 
Instructions 
that Affect Carry and Overflow 
Flags Only 


The instructions 
in Table 3-9 are used to affect specific flags. For example, STI is 
used to set the Interrupt 
flag to 1. 


CLC - Clear Carry 
CLD - Clear Direction 
CLI - Clear Interrupt 
CM C - Complement 
Carry 


SAHF - Move 
AH to 8080 
flags 


STC - Set Carry 
STD - Set Direction 
STI - Set Interrupt 


The 
instructions 
in Table 
3-10 affect 
the Parity, 
Sign, and Zero 
flags. The 
Carry, 
Overflow, 
and Arithmetic 
flags are undefined 
following 
execution 
of these 
instructions. 


Table 3-10. Instructions 
that Affect Parity, Sign and 


Zero Flags 


I 
II~_DIV 
IDIV_ 


Table 3-12. 
Instructions 
that Restore All the 8086 Flags from the 
Stack 


The 
instructions 
in Table 
3-13 
clear 
the 
Interrupt 
and 
Trap 
flags. 
The 
INTO 
instruction 
only affects 
these 
flags if the Overflow 
flag is I. 


There are two major topics of interest concerning 
8086 addressing modes: 


1. 
How the memory address is formed. 


2. 
What addressing modes are available. 


All 8086 memory addresses are computed by summing the contents of a Segment 


register and an effective memory address. The effective memory address is computed 
via a variety of addressing 
modes, as it would be for any other microprocessor. 
The 


selected Segment register contents are left-shifted four bits, then added to the effective 
memory address to generate the actual address output as follows: 


Segment Register contents: 
X X X X X X X X X X X X X X X X 0 000 
Effective memory address: 
+ 0000 
Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y 


Actual address output: 
X X X Z Z Z Z Z Z Z Z Z Z Z Z Z Y Y Y Y 


Thus a 20-bit memory address is computed 
- 
which allows 1,048,576 bytes of 


external memory to be addressed directly. 
An 8086 address is therefore 
composed of two distinct addresses: 
the Segment 


register contents, 
referred to as the segment address, and the effective memory address, 


referred to as the offset address. 


The segment registers of the 8086 are unlike any other microprocessor 
registers. 


They act as base registers which can point to any memory location that lies on an address 
boundary that is an even multiple of 16 bytes. Using arbitrary memory addresses, 
this 


may be illustrated as follows: 


Memory 
Address 


CS Segment register 


contains 234E 16 


ES Segment register 


contains OA321 6 


OS Segment register 


contains 021 F16 


1A31 F16 
- 
- - - 
- 
} I 


121 EF16 
- 
} 
_ 
~ 
c: 
tfl 
Q) 
w 


OA32016-- !- 


021 F016 
- 


As illustrated above, each segment register identifies the beginning of a 65,536- 


byte memory segment. Since the 8086 has four segment registers, there will at any time 
be four selected 65,536-byte 
memory segments. 
The actual address output will always 
select a memory location within one of these four segments. 
For example, if an actual 


address output is the sum of the DS Segment register and an effective memory address, 
then the actual address output must select a memory location within the DS segment; 
that is to say, within the address range 021FOl6 through 
121EFI6 
in the illustration 
above. Likewise, an actual address output which is the sum of the CS Segment register 
and an effective memory address must select a memory location within the CS segment, 
which in the illustration above will lie in the address range 234EOl6 through 334DF 16' 


No restrictions 
are placed on the contents 
of segment 
registers. Therefore 
8086 
memory is not divided into 65,536-byte pages, nor do the four segment registers have to 
specify non-overlapping 
memory spaces. Each segment register identifies the origin of a 
65,536-byte memory segment which may lie anywhere within addressable memory, and 
mayor 
may not overlap with one or more other segments. 
8086 addressing modes can be divided into two distinct types: 


1. 
Program memory addressing modes. 


2. 
Data memory addressing modes. 


We will discuss each of these topics, then at the end of this section show how they 
are implemented 
on the 8086. 


Whenever 
an instruction 
fetch is performed, 
the address of the memory location 
from which the instruction 
is fetched is computed as the sum of an offset taken from the 
program counter (also called the PC register) and a segment taken from the CS register. 
Normally, 
the PC register 
contents 
are incremented 
as instructions 
are executed. 


However, 
Jump and Call instructions 
may modify the PC register contents 
in one of 
three ways: 


1. 
Program relative addressing. An 8-bit or 16-bit displacement 
provided by the 
instruction 
in the form of immediate 
data is added to the PC register as a 
signed binary number. This does not alter the CS register contents. Therefore 
it is termed an intrasegment 
operation. 


2. 
Direct addressing. New 16-bit addresses present in the instruction in the form 
of immediate 
data are loaded into the program counter and the CS register. 


This is referred to as an intersegment 
operation. 


3. 
Indirect addressing. 
Any of the data memory addressing 
options (which we 
will describe next) may be used to read data from data memory. However, the 
data input is interpreted 
as a memory address by the Jump or Call instruction. 
You have two indirect addressing options. A single 16-bit data word may be 
read, in which case it is loaded into the program counter and the Jump or Call 
references a memory location within the current CS segment. 
You can also 
read two 16-bit data words: the first is loaded into the program counter and 
the second is loaded into the CS Segment register. Thus you can Jump or Call 
any addressable 
memory location using indirect addressing. 


The 8086 offers a wide variety of addressing; 
we will condense 
it into six basic 
options. These options are: 


1. 
Immediate 
2. 
Direct 
3. 
Direct, Indexed 


4. 
Implied 
5. 
Base Relative 


6. 
Stack 


In this form 
of addressing, 
one of the 
operands 
is present 
in the 
byte(s) 
immediately following the instruction 
object code (op-code). If addressing bytes follow 
the op-code, then the immediate data will follow the addressing bytes. For example: 


ADD 
AX. 3064H 


requests the Assembler to generate an ADD instruction which will add 306416 to the AX 
register. This may be illustrated 
as follows: 


Data 
Memory 


Program 
(Relative to the 


Memory 
CS Register) 


Note that the 16-bit immediate 
operand, 
when stored in program memory, 
has 
the low-order byte preceding the high-order 
byte. This is consistent 
with the way the 
8080A stores immediate 
operands in program memory. 
In addition, 
this is consistent 
with the way the 8086 stores 16-bit operands in data memory. When a 16-bit store is per- 
formed, the low-order 8 bits of data are stored into the low-order memory byte, and the 
high-order 
8 bits of data are stored into the succeeding memory byte. 
In this example, the two bytes immediately following the op-code for the ADD to 
AX instruction 
are added to the AX register. 


The 8086 implements 
straightforward 
direct memory addressing by adding a 16- 
bit displacement, 
provided by two object code bytes, to the Data Segment register. The 
sum becomes the actual memory address. This may be illustrated as follows: 


0 
Program 


07 
0 
Memory 


ppppm 


ppppm + 1 


hh 
ppppm + 2 


ppppm + 3 


0 


15 


CS 


H. L, M, N, P, Rand S all represent any hexadecimal digits . 


• Actual data memory address output for direct memory addressing. 


Note that a 16-bit address displacement, 
when stored in program memory, has the 
low-order byte preceding the high-order byte. This is consistent with the way the 8080A 
stores addresses in program memory. 
DS must 
provide 
the segment 
base address 
when 
addressing 
data memory 
directly, as illustrated 
above. 


L 


Direct, indexed addressing 
is allowed by specifying the SI or DI register as an 
index register. You have the option of adding an 8-bit or 16-bit displacement 
to the con- 
tents of the specified index register in order to generate the effective address. 


A 16-bit displacement 
is stored in two object code bytes; the low-order byte of the 
displacement 
precedes the high-order 
byte of the displacement, 
as illustrated for direct 
memory addressing. 
If an 8-bit displacement 
is specified, then the high-order 
bit of the 
low-order byte is propagated into the high-order 
byte to create a 16-bit displacement. 


This may be illustrated a follows: 


15 
7 


AX 
= AH 
+ ALl 


BX 
= BH + BL >------- 


CX 
= CH + CL ------ 


DX 
= DH 
+ DL ------ 


15 


5P 


BP 


0 
Program 


07 
0 
Memory 


I 
I 


ppppm 


ppppm + 1 


II 
ppppm + 2 


hh 
ppppm + 3 


0 


15 


C5 


M, N, P, R. X. H, L. and Z all represent any hexadecimal digits. 


YYYY is the 16-bit or B-bit displacement 
taken from program memory. 


XXXX is the index taken from either the DI or 51 register. 


Implied memory addressing is implemented 
on the 8086 as a degenerate 
version 
of a direct, indexed memory addressing. If you do not specify a displacemen t when using 
the direct, indexed addressing mode, then you have, in effect, implied memory address- 
ing via the SI or DI register. This may be illustrated as follows: 


0 
Program 


07 
0 
Memory 


ppppm 


ppppm + 1 


ppppm + 2 


ppppm + 3 


0 


(You may substitute 
CS. SS or ES for OS by executing 
an additional 
1-byte instruction.) 


X, Rand S represent any hexadecimal 
digits. 


The 8086 implements 
base relative addressing 
in two ways: 


• 
Data memory base relative addressing, 
which is within the OS segment 
(data 


memory) 


• 
Stack base relative addressing, 
which is in the SS segment 
(stack memory) 


Data memory base relative addressing 
uses the BX register contents 
to provide 
the base for the effective address. All of the data memory addressing options thus far de- 
scribed, with the exception 
of immediate 
addressing, 
are available with base relative 
data memory addressing. 
In effect, base relative data memory addressing merely adds 
the contents of the BX register to the effective memory address which would otherwise 
have been generated. Here, for example, is an illustration of base relative direct address- 
ing: 


15 
7 


AX = AH + AL 


BX = BH + BL 
kk 


CX = CH + CL 


DX = DH + DL 


15 


SP 


BP 


SI 


DI 


PC 


15 


CS 


DS 


SS 


ES 


0 
Program 


07 
0 
Memory 


Data 
Memory 
Address 
Calculation 
ppppm 


kk 
II 
ppppm 
+ 1 


hh 
ppppm 
+ 2 


ppppm 
+ 3 


0 


Simple, direct addressing, 
which we described earlier, always generated 
a I6-bit 
displacement. 
Base relative, direct addressing allows the displacement, 
illustrated above 
as HHLL, to be a I6-bit displacement, 
an 8-bit displacement 
with sign extended, 
or no 


displacement 
at all. 


Base relative implied memory 
addressing 
simply adds the contents 
of the BX 
register to the selected Index register in order to compute the effective memory address. 
This may be illustrated as follows: 


15 
7 


AX = AH + AL 


BX = BH + BL 
kk 


CX = CH + CL 


DX = DH + DL 


15 


SP 


BP 


SI 


DI 


PC 


15 


CS 


DS 


SS 


ES 


0 
Program 


07 
0 
Memory 


ppppm 


kk 
ppppm + 1 


ppppm + 2 


ppppm + 3 


0 


Data Memory 
Address 
Calculation 


Base relative, direct, indexed data memory addressing may appear to be compli- 


cated, but in fact it is not. We simply add the contents of the BX register to the effective 
memory address, as computed 
for normal direct, indexed addressing. Thus, base rela- 
tive, direct, indexed data memory addressing may be illustrated as follows: 


15 
7 


AX = AH + AL 


BX = BH + BL 
kk 


CX = CH + CL 


DX = DH + DL 


15 


5P 


BP 


51 


DI 


PC 


15 


C5 


D5 


55 


E5 


Program 
Memory 


Data 
Memory 
Address 
Calculation 


ppppm 
+ 1 


ppppm 
+ 2 


The index xxxx in the illustration above is optional. Base relative, direct memory 
addressing 
is also available. 
In this instance neither 
SI nor DI will contribute 
to the 
address computation, 
and Oxxxx must be removed from the illustration. 


The 8086 also has stack memory addressing 
variations of the base relative, data 


memory addressing options just described. In this case, however, the BP register is used 
as the base register. Here, for example, is base relative, direct stack addressing: 


Program 
Memory 


ppppm 
+ 1 


ppppm 
+ 2 


15 


CS 


In the illustration above, the displacement 
HHLL is present, either as a 16-bit displace- 
ment 
or as an 8-bit displacement 
with sign extended. 
Base relative 
stack memory 
addressing requires a displacement 
be specified, even if zero. 


The 8086 obviously otTers an extensive 
selection of addressing modes. The next 
question is: how are these addressing modes implemented 
in the object code? The 8086 


specifies most data memory addressing modes in an instruction's 
object code using one 
byte of object code, known as the addressing mode byte. The addressing mode byte may 
have one or two additional displacement 
bytes associated with it. The addressing mode 
byte is always the second byte of the instruction 
object code, unless a prefix instruction 
has been included prior to the initial object code. The addressing 
mode byte may be 


illustrated as follows: 


mod 
reg 
rim 
.--..-------- 
= 
---------.- 


xx is two bits that form the mod field. The mod field is used 
to distinguish between memory and register addressing. and 
in the case of memory addressing. specifies how many dis- 
placement bytes follow 
the addressing mode byte. 


yyy is three bits that form the reg field. The reg field defines 
which 
register 
will 
be used in the operation. In addition. 


these three bits may be used to specify instruction. 


zzz is three bits that form the rim field. The rim field is used 
in conjunction with the mod field to specify the addressing 
mode. 


Memory addressing mode. rim specifies the exact addressing option. 


There are no displacement 
bytes. 


Memory addressing mode. rim specifies the exact addressing option. 


There is one displacement 
byte. This displacement 
byte is viewed 
as a signed number in the range + 127 to -128. 
When this num- 
ber is used in the memory address calculation, the number is sign 
extended 
to 16 bits. In this case, the addressing mode bytes can 
be illustrated as follows: 


where mod = 01 and disp is the 8-bit signed displacement 
value. 


Memory addressing mode. rim specifies the addressing option. There 


are two displacement 
bytes. The first displacement 
byte is the 
low-order 
eight bits of the displacement. 
The second displace- 


ment byte is the high-order eight bits of the displacement. 
When 
this number is used in the memory address calculation, the num- 
ber is treated 
as an unsigned 
16-bit number. 
In this case, the 
addressing mode bytes can be illustrated as follows: 


where mod = 10, disp low is the low-order eight bits of the dis- 
placement, 
and disp high is the high-order 
eight bits of the dis- 


placement. 


11 
register addressing 
mode. rIm specifies a register. Used in conjunc- 


tion with the w bit to determine 
if an 8- or 16-bit register 
is 
selected. 


reg 
reg is used in conjunction with another bit, the w bit, in the selection 
of the register to be used in the operation. The w bit, which is part 
of the instruction 
op-code, selects whether an 8- or 16-bit opera- 


tion is performed. 


reg 
w = 0 
w = 1 


000 
AL 
AX 
001 
CL 
CX 
010 
DL 
DX 
011 
BL 
BX 
100 
AH 
5P 
101 
CH 
BP 
110 
DH 
51 
111 
BH 
DI 


rIm 
rIm 
specifies 
the 
addressing 
mode 
in conjunction 
with mod, 
as 


follows: 


mod - 11 


rIm 
mod - 00 
mod - 01 
mod - 10 


w = 0 
w = 1 


000 
BX + 51 
BX + 51 + DI5P 
BX + 51 + DI5P 
AL 
AX 


001 
BX + DI 
BX + DI + DI5P 
BX + DI + DI5P 
CL 
CX 


010 
BP + 51 
BP + 51 + DI5P 
BP + 51 + DI5P 
DL 
DX 
011 
BP + DI 
BP + DI + DI5P 
BP + DI + DI5P 
BL 
BX 


100 
51 
51 + DI5P 
51 + DI5P 
AH 
5P 


101 
DI 
DI + DI5P 
DI + DI5P 
CH 
BP 


110 
Direct 
BP + DI5P 
BP + DI5P 
DH 
51 


Address 


111 
BX 
BX + DI5P 
BX + DI5P 
BH 
DI 


This table is self-explanatory, 
with the exception of Direct Address. When mod is 


00 and rIm is 110, the offset address is taken directly from the two bytes that follow the 
addressing 
mode byte. This can be illustrated as follows: 


where mod is 00, rIm is 110, addr-high is the high-order 
8 bits of the offset address and 
addr-Iow is the low-order 8 bits of the offset address. 


Every addressing mode has a standard default segment register. In most cases you 


can select an alternative segment register by using a segment override prefix. To use the 
prefix, place the following byte in front of the instruction whose default segment register 
assignment 
is to be overriden. 


~ 
T 
rr is two bits that select the segment register to be used in 
the following 
instruction. 


rr = 00 for the ES register 
01 for the C5 register 
10 for the 55 register 
11 for the D5 register 


In three cases, the segment override may not be used. They are: 


1. 
Stack reference 
instructions 
(e.g., PUSH and CALL) 
that use the stack 


pointer (SP register) to compute the offset always use the SS register as the 
segment register. 


2. 
String instructions 
that use the DI register always use the ES register as the 
segment register. In a string operation where both SI and DI are used 
(e.g., 
MOYS or CMPS), 
a segment 
override 
prefix, if present, 
overrides 
the SI 
offset's segment register. 


3. 
Segment override prefixes cannot be used with program memory addressing. 
All instruction 
fetches are relative to the CS Segment register. 


Memory addressing modes and memory addressing byte information can be com- 
bined and summarized 
as follows: 


rIm = 
mod = 
00 
mode = 01 
mod = 10 


000 
Base Relative 
Indexed 
Base Relative 
Indexed 
Base Relative 
Direct 


BX + SI 
BX + SI + DISP 
Indexed 


BX + SI + DISP 


001 
Base Relative 
Indexed 
Base Relative 
Direct 
Base Relative 
Direct 


Indexed 
Indexed 


BX + 01 
BX + 01 + DISP 
BX + 01 + DISP 


010 
Base Relative 
Indexed 
Base Relative 
Direct 
Base Relative 
Direct 


Stack 
Index ed Stack 
Indexed 
Stack 


BP + SI 
BP + SI + DISP 
BP + SI + DISP 


011 
Base Relative 
Indexed 
Base Relative 
Direct 
Base Relative 
Direct 


Stack 
Indexed 
Stack 
Indexed 
Stack 


BP + 01 
BP + 01 + DISP 
BP + 01 + DISP 


100 
Implied 
Direct, 
Indexed 
Direct, 
Indexed 


SI 
SI + DISP 
SI + DISP 


101 
Implied 
Direct, 
Indexed 
Direct, 
Indexed 


01 
01 + DISP 
01 + DISP 


110 
Direct 
Base Relative 
Direct 
Base Relative 
Direct 


Direct 
Address 
Stack 
Stack 


BP + DISP 
BP + DISP 


111 
Base Relative 
Base Relative 
Direct 
Base Relative 
Direct 


BX 
BX + DISP 
BX + DISP 


Note that two operand instructions 
will very frequently access one operand out of 
memory, 
while the other operand is in a CPU register. Also, both operands 
will fre- 
quently be accessed out of CPU registers, The 8086 does not allow both operands to be 
accessed out of memory, with the exception of several special data string manipulation 
instructions. 
The following options are available: 


Source Operand 


CPU Register 
Memory Location 
CPU Register 


Destination 
Operand 


CPU Register 
CPU Register 
Memory Location 


Result 


CPU Register 
CPU Register 
Memory 
location 


In the following section, each 8086 assembly language instruction 
is discussed. 
The format for each description 
is composed of six distinct parts: 


1. 
The instruction 
mnemonic and the various operands associated with it. Varia- 
ble operands are signified by lower-case letters. The mnemonic 
itself and any 
fixed operands are signified by capital letters. Here is an example: 


IN 
ac.DX 
11 
t 
,,,., .,"""' ,"eo""" 
Variable 
in lower case 


Mnemonic 
in capitals 


2. 
A description 
of the instruction's 
operation. 


3. 
The machine language encoding of the instruction. 


4. 
An example of the instruction's 
operation. This is not present for some very 
simple instructions. 


5. 
A diagram of the instruction's 
execution, 
which shows the effect the instruc- 
tion has on the 8086 flags, registers, and memory. 


6. 
A Notes section that includes assorted information 
such as short examples of 
how the instruction 
might be used, or related instructions 
that might be more 
effective in particular instances. 


These are the abbreviations 
used for the operands described with the mnemonics: 


Either the AL register, if an 8-bit operation 
is specified, or the AX 


register, if a 16-bit operation is specified. This will be represented 
in an 8086 assembly language instruction 
by AL or AX. 


An 8086 address composed 
of two 16-bit addresses, 
a 16-bit offset 
address and a 16-bit segment 
address. 
Typically, this is repre- 
sented by a label in an 8086 assembly language instruction. 


Either 1 or the contents of the CL register. This will be represented 
by 


1 or CL in an 8086 assembly language instruction. 


8 or 16 bits of immediate data. This can appear as any of a wide selec- 


tion of numeric representations 
or expressions in an 8086 assem- 


bly language statement. 


8-bit signed binary displacement 
used by the Jump and Jump-on- 
Condition 
instructions. 
Invariably this will be represented 
by a 


label in an 8086 assembly language instruction. 


mem/reg 


port 


16-bit binary 
displacement 
used 
by the 
Call, Jump, 
and 
Return 


instructions. 
When used in the Call and Jump instructions, 
this is 


almost always represented 
by a label. The Return instruction 
will 


typically use a numeric expression 
to represent 
disp 16. Its use 
with the Return instruction 
will be shown. 


Memory operand. The addressing mode used to select the operand is 


specified 
by the addressing 
mode 
byte. This will typically be 


represented 
by a label, in which case the assembler will select the 


appropriate addressing mode byte, or a sequence of symbols that 
allows the selection of a specific addressing 
mode byte. 


Memory or register operand. Consult descriptions 
for mem and reg. 


An I/O port. This will be represented 
by a numeric representation 
or 


an expression. The port number must be beween 016 and FFI6. 


Register AH, AL, BH, BL, CH, CL, OH, or OL if an 8-bit operation 


is specified; register AX, BX, CX, OX, SP, BP, SI, or DI if a 16- 
bit operation is specified. 


Register CS, OS, ES, or SS. 


These are the abbreviations 
used in describing the instruction's 
encoding. 


One bit used in the shift and rotate instructions 
selecting either 
1 or 


the contents of the CL register to be the number of shifts/rotates 
to be performed. 


c = O. 
Shift/rotate 
once 
c = 1. 
shift/rotate 
the number of times 
specified by the CL register. 


One bit used to specify the direction 
in which an operation 
is per- 


formed. 


8 bits used as a signed binary displacement 
by the Jump and Jump-on- 


Condition 
instructions. 


Two hexadecimal digits, used to represent immediate data or part of a 


16-bit displacement. 


Two hexadecimal digits, used to represent immediate data or part of a 


16-bit displacement. 


8-bit addressing mode byte that is described in earlier in this chapter. 


Three bits selecting one of the 8086 general-purpose 
registers 


IF 


rrr = 000 for AL 
001 for CL 
010forOL 
all 
for BL 
100 for AH 
101 for CH 
110 for OH 
111 for BH 


rrr = 000 for AX 
001 for CX 
010 for OX 
all 
for BX 


100 for SP 
101 for BP 
110 for SI 
111 for 01 


The following symbols are used in the example use of instructions: 


This will appear at the end of a group of digits to specify that the digits 


be treated as hexadecimal 
digits. 


These 
are used to indicate 
the contents 
of the memory 
location 


addressed by the expression inside the brackets. Suppose that the 
BX register contains 054A16. The expression 


[BX] 


refers to the memory 
location 
that has an offset address 
of 


054A 16 in the current data segment. 


Are all used to represent 
one hexadecimal 
digit. For example, 


jjkk 


g,hj,k,m,n,p, 
q,r,s,t,u,v,w, 
x,y,z 


One bit indicating 
whether 
or not immediate 
data is to be sign 
extended. 
If a 16-bit operation with immediate data is specified, it 
is possible that the immediate 
operand 
can be expressed 
using 
just 
one byte of program 
memory 
space. s is interpreted 
as 


follows: 


s = O. 
Two bytes are necessary for the immediate 
data. 


no sign ex tension is performed. 


s = 1. 
One byte of immediate 
data is present. 


To form the sixteen 
bits of immediate 
data necessary for the operation. 
sign 
ex tend the high-order bit of the 
immediate 
data byte. 


ss = 
00 for ES 
01 for CS 
10 for SS 
11 for DS 


One bit indicating the location to which a software interrupt should be 


vectored. If v = 0, then the interrupt service routine is located at 
the address specified at location OOOOC 16' otherwise the address is 
determined 
by the succeeding byte. 


One bit indicating whether an 8- or 16-bit operation 
is performed. 


w = 0 
B-bit operation 
w = 1 
16-bit 
operation 


Three don't care bits. 


Two hexadecimal digits indicating the I/O port number to be used by 


the instruction. 


is used to represent 
a 16-bit data element; 


ppppm 


is used to represent 
a 20-bit address. 


EA 
Effective 
address. 
EA appears 
in calculations 
for the number 
of 


execution cycles required by individual instructions. 
EA specifies 


addressing 
mode 
execution 
cycles, which must 
be added 
as 


follows: 


Direct 
Addressing 
Direct. 
Indexed 
Addressing 
Implied 
Addressing 
Base Relative 
Addressing 
Base Relative 
Direct 
Addressing 
Base Relative 
Indexed 
Addressing 
Base Relative 
Direct 
Indexed 
Addressing 


ADD 6 cycles 
ADD 9 cycles 
ADD 5 cycles 
ADD 5 cycles 


A segment 
Override 
Prefix 


is present 
ADD 2 cycles 


A 16-bit 
word 
is addressed 
and the word 
resides at an 
odd memory 
address 
ADD 4 cycles 


BP + 51 and BX + DI modes 
require one more clock than 
BP + DI and BX + 51 modes. 


8086 
ASSEMBLY 
LANGUAGE 
INSTRUCTIONS 
ORGANIZED 
ALPHABETICALLY 


This instruction 
is used to adjust a result in the AL register, assuming this result 
was generated 
by adding two ASCII characters 
as operands. 
The adjustment 
is per- 
formed in the following manner: 


1. 
If the low-order four bits of the AL register are between 0 and 9 and the AF 
flag is 0, then go to Step 3. 


2. 
If the low-order four bits of the AL register are between A and F or the AF 
flag is 1, then add 6 to the AL register, add 1 to the AH register, and set the 
AF flag to 1. 


3. 
Clear the high-order 
four bits of the AL register. 


The encoding for this instruction 
is: 


AAA 
37 


For example, suppose that the AX register contains 053516 and the BL register contains 
3916, Executing the sequence 


ADD 
AL.BL 
AAA 


would result in AX containing 060416, The ADD instruction results in 


3516=0011 
0101 
3916 = 0011 
1001 


0110 
1110 


6E being stored into AL. The AAA instruction 
performs steps 2, 3, and 4 of the adjust- 
ment algorithm, which results in the AF and CF flags being set to 1,04 being stored into 
the AL register, and the AH register being incremented 
to 06. 


Data 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 
yy 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Registerl 


SP 
37 
ppppm 


BP 
ppppm + 1 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 


SS 
Address Calculation 


ES 


AAA 
Number of cycles: 
4 


1. 
Note that this instruction 
would also work if the two operands were one-digit BCD 
numbers. 
Why one should desire to do this sort of operation is left up to the reader. 


2. 
To perform corrections on the addition of two packed BCD digits, refer to the DAA 
instruction. 


3. 
As a result of this instruction, 
the values of the Overflow, Parity, Sign, and Zero 
flags are undefined. 


This instruction 
assumes 
that the AH and AL registers contain unpacked 
BCD 
operands. 
This instruction 
converts this information 
into a binary operand in the AL 
register. The algorithm for conversion 
assumes that the tens digit is in the AH register 
and the units digit is in the AL register. The AAD algorithm is as follows: 


1. 
Multiply the contents of the AH register by OAI6. 


2. 
Add AH to AL. 


3. 
Store 0016 into the AH register. 


4. 
Set the flags in the following manner: 


Carry, Overflow, Arithmetic: 
undefined 


Parity: based on the AL register 
Zero: based on the AL register 
Sign: based on the high-order 
bit of the AL register 


The encoding for this instruction 
is: 


AAO 
...-.....-- 


050A 


Suppose that the AX register contains 060416• After the instruction 


AAO 


has executed, the AX register will contain 004016• The flags will be set as follows: 


Carry: undefined 
Overflow: undefined 
Arithmetic: 
undefined 


Sign: high-order 
bit of AL register is 0, set Sign to 0 


Zero: AL register is non-zero, 
set Zero to 0 


Parity: one 1 bit in AL register, set Parity to 0 


AAD 
Number of cycles: 60 


1. 
This instruction 
can also be used to adjust ASCII operands for division. For exam- 


ple, consider the case where the AX register contains 353716, After the instructions 


AND 
AX,OFOFH 
AAD 


This instruction 
adjusts a result in the AL register, assuming that a multiplication 
has been performed 
with two unpacked BCD numbers 
as operands. The adjustment 
is 
performed 
as follows: 


1. 
Divide the AL register by OAI6. Store the quotient in the AH register. Store 
the remainder 
in the AL register. 


2. 
Set the flags in the following manner: 


Carry, Overflow, and Arithmetic: 
undefined 


Parity: based on the AL register 
Sign: based on the high-order 
bit of the AL register 


Zero: based on the AL register 


The encoding for this instruction 
is: 


AAM 
'-v-' 
D40A 


Suppose that the AL register contains 0716 and the BL register contains 0916, After the 
sequence of instructions 


the AX register will contain 060316, The MUL instruction 
results in 3FI6 being stored 
into the AL register. Performing 
steps 1 and 2 of the adjustment 
algorithm results in 
060316 in the AX register, and the flags are set in the following manner: 


Carry: undefined 
Overflow: undefined 
Arithmetic: 
undefined 


Sign: high-order 
bit of AL is 0, set Sign to 0 


Zero: the AL register is non-zero, 
set Zero to 0 


Parity: two 1 bits in the AL register, set Parity to 1 


Data 


0 
0 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 


AX 
xx 


BX 


CX 


OX 


Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
04 
ppppm 


BP 
DA 
ppppm + 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


::~ 


nn 


I 


nn 


Program Memory 
Address Calculation 


E5 


AAM 
Number of cycles: 
B3 


This instruction 
adjusts a result in the AL register, 
assuming that a subtraction 
has been performed 
with two ASCII characters 
as operands. 
The adjustment 
is per- 
formed as follows: 


1. 
If the low-order four bits of the AL register are between 0 and 9 and the AF 
flag is 0, then go to Step 3. 


2. 
If the low-order four bits of the AL register are between A and F or the AF 
flag is 1, then subtract 
6 from the AL register, 
subtract 
1 from the AH 
register, and set the AF flag to 1. 


3. 
Clear the high-order 
four bits of the AL register. 


4. 
Set the CF flag to the value of the AF flag. 


The encoding for this instruction 
is: 


AAS-.- 
3F 


For example, 
suppose 
that the AX register contains 
043816, 
After the sequence 
of 
instructions 


3816 : 
0011 
1000 


Twoscompof3516:1100 
1011 


0000 
0011 


0316 being stored into AL. The AAS instruction 
performs steps 1 and 3 of the adjust- 
ment algorithm, 
which in this case does not modify the AX register. The AF and CF 
flags are set to O. 


Data 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 
xx 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
3F 
ppppm 


BP 
ppppm 
+ 1 


SI 
ppppm 
+ 2 


DI 
ppppm 
+ 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program 
Memory 


SS 
Address 
Calculation 


ES 


AAS 
Number 
of cycles: 
4 


1. 
To adjust the results of an ASCII addition, consult the AAA instruction. 
To adjust 
the results of packed BCD addition and subtraction, 
consult the DAA and DAS 
instructions. 


2. 
The values of the Parity, Zero, Sign, and Overflow flags are undefined following the 
execution 
of this instruction. 


This instruction 
is used to add the immediate data present in the succeeding pro- 
gram memory 
bytes and the Carry status to the AL (8-bit operation) 
or AX 06-bit 
operation) 
register. 


The encoding for this instruction 
is: 


ADC 


0001010w 
~ 
L 
High-order 8 
bits 
of 
the 
immediate 


operand.This byte is only present if w = 
1. 


Low-order 
8 
bits 
of 
the 
immediate 


operand. This byte is always present. 


w = a 
8-bit operation. AL is one of 


the operands and the destination for the 
result. 


w = 1 
16-bit operation. AX is one of 


the operands and is the destination for 
the result. 


Consider, 
for example, the case where the AX register contains 4F3DI6 
and the Carry 
status is 1. After the instruction 


4F3D16 =0100 
1111 
0011 
1101 


FD8116=1111 
1101 
1000 
0001 


Carry Status = 
1 


0100 
1100 
10111111 


11 


t 
t 
7 one bits, set P to a 
Arithmetic status is set to a 
. 
sign bit is set to a 


------------Carry 
status is 1 
Overflow flag is set to 0, 


Zero Status is set to a 
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Data 


0 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 


AX 
xx 
yy 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
C5 RegISter! 


5P 
15 
ppppm 


BP 
kk 
ppppm + 1 


51 
jj 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
Program Memory 


55 
Address Calculation 


E5 


ADC 
AX,jjkk 
Number of cycles: 
4 


Notes: 


1. 
This instruction 
performs the same function as the 8080 instruction 
ACI data. In 


addition, this instruction 
offers a 16-bit Add With Carry Immediate 
option. 


This instruction 
is used to add immediate data present in the succeeding program 


memory byte(s) and the Carry status to the specified register or memory location. An 8- 
bit or 16-bit operation 
may be specified. 
The encoding for this instruction 
is: 


AOC 


100000sw 


memlreg, 


mod 010 rim 


data 


[ T,;,"_",,,,,,,,.,,". 
"-M 


immediate operand, This byte 
is only present if s = 0 and 
w = 1. 


l----------Low-order 
byte 
of 
the 


immediate 
operand. This byte 


is always present. 


Addressing mode byte(sl. As 
described earlier in this chapter. 


w = 0 
a-bit operation 


w = 1 
16-bit operation 


s is the sign extension bit. If 
w = 0, this bit is ignored. If 
w = 1 then 


s = 
0, all 16 bits of the 


immediate operand 
are present. 


s = 1. only the low-order 
a 


bits of the immediate 
operand are present. 
The high-order a bits 
of the 16- bit operand 
are formed 
by sign 


extending the high- 
order bit of kk. 


Suppose that the DS register contains E40016, 
the SI register contains 004016, 
the word 
at memory location E404016 is 689016, 
and the Carry status is O.After the instruction 


AOC 
[51!. 2031 H 


executes, the word at memory location E4040 16 will contain 98C 116 and the Carry status 
will be O. 


689016 
= 0110 
1011 
1001 
203116=0010 
1101 
0011 
Carry status = 


Data 
0 
0 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 


yy 


~~I 


xx 


Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 
Data Memory 
81 
ppppm 
Address Calculation 


BP 
14 
ppppm + 1 


51 
gg 
gg 
kk 
ppppm + 2 


01 
jj 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


05 
hh 
hh 
Program Memory 


55 
Address Calculation 


E5 


AOC 
[51),jjkk 


Number of cycles: Immediate to memory: 
17 + EA 
Immediate to register: 4 


1. 
This instruction 
is not normally used to ADC immediate 
data to the AL or AX 
register. The instruction 
ADC ac,data is provided for that purpose. 


2. 
Segment registers may not be specified as operands in this instruction. 


Register to Register 
Register to Memory 
Memory to Register 


Add the contents 
of the register or memory location specified by mem/reg2 
and 
the Carry status to the contents 
of the register or memory location specified by mem/ 
reh 
An 8- or 16-bit operation may be specified. Either mem/reg\ 
or mem/reg2 
may be a 
memory operand, 
but one of the operands must be a register operand. 
The encoding for this instruction 
is: 


ADC 
mem/reg 1, mem/reg2 


000100dw 


Addressing mode byte(s) as described 
earlier in this chapter. 


w =0 
8-bit operation 
w = 1 
16-bit operation 


d is the direction flag. If d = 0, then the 
operand described by mod and rim is 
mem/reg 1 and the operand described 
by reg is mem/reg2. If d = 1, then the 
operand described by mod and rim is 
mem/reg2 and the operand described 
by reg is mem/reg 1. 


Suppose that the AX register contains 0211\6, the BX register contains 008416, the OS 
register contains 1COOI6,the Carry status is 1, and the contents of the memory word at 
1C084\6 are 00A416• After the instruction 


ADC 
AX,IBX] 


Carry = 
1 
021116=0000 
0010 
0001 
0001 
00A416 
= 0000 
0000 
1010 
0100 


1 


0 


1 


29 


1 


_00 
_00_10 _101_'f_ 


11 
t__ 
5 one bits. set P to 0 


No carry, set AF to 0 


Overflow is 0 


------------Si9n 
Status is 0 


------------ 
No carry, set C to 0 
Non-zero result, set Z to 0 
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Data 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


ww 
rrrrg 


AX 
xx 
yy 
vv 
rrrrg + 1 


BX 
gg 
gg 
rrrrg + 2 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Registerl 


SP 
13 
ppppm 


BP 
07 
ppppm + 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
hh 
hh 


SS 


ES 


ADC 
AX,[BX] 
Number of cycles: 
Memory to register: 
9 + EA 
Register to memory: 
16 + EA 
Register to register: 
3 


Notes: 


1. 
This instruction 
is not normally used to ADC to the AX or AL registers. The ADC 
ac,data instruction 
accomplishes 
that function in fewer bytes. 


This instruction 
is used to add the immediate data present in the succeeding pro- 
gram memory byte(s) to the AL (8-bit operation) 
or AX 06-bit 
operation) 
register. 
The encoding for this instruction 
is: 


ep__ 
I 
f 
IH;,"~""'''8 ", 
., 
", ;mm"'''. 
L 
operand.This byte is only presentif w = 1. 


Low-order B bits of 
the immediate 


operand.This byte is always present. 


w = 0 B-bit operation.AL is one of the 
operands and the destination for the 
result. 


w = 1 16-bit operation.AX is oneof the 
operandsand is the destination for the 
result. 


406416 = 0100 
0000 
0110 
0100 
OFOF16= 0000 
1111 
0000 
1111 


0100 
1111 
0111 
0011 


11 


1 
t 
t 
5 one bits, set P to 0 


Carry out of bit 3, set AF to 1 


------------No 
carriesout of bits 14 or 15, set 0 to 0 


_-------------0 
sets S to 0 
------------No 
Carry sets C to 0 


Non-zeroresult, set Z to 0 


ADD 
AX.jjkk 
Number of cycles: 4 


Notes: 


1. 
This instruction 
performs the same function as the 8080 instruction 
ADI data. This 
instruction 
has the additional capability of adding 16-bit immediate data elements. 


This instruction 
is used to add the immediate data present in the succeeding pro- 
gram memory 
byte(s) 
to the specified register or memory 
location. An 8- or 16-bit 
operation may be specified. 
The encoding for this instruction 
is:L 


k 


k 
_c=c=J 
L 
High-order byte of the 16-bit 
immediate operand. This byte is 
only present if s = a and w = 1 


Low-order 
byte of the 


immediate 
operand. This byte 


is always present. 


Addressing mode byte (s) as 
described earlier in this 
Chapter. 


w = 0 
a-bit operation 


w = 1 
16-bit operation 


s is the sign extension bit. If w = 
O. this bit is ignored. If w = 1 
then 


s = O. all 16 bits of the immedi- 
ate operand are present. 


s = 1. only the low-order a bits 
of the immediate 
operand are 


present. The high-order 
a bits 


of 
the 
16-bit 
operand 
are 


formed 
by sign extending the 


high-order bit of kk. 


For example, if the DX register should contain 465216 and the instruction 


ADD 
DX,OFOFOH 


is executed, then the DX register contents will be altered to 374216, 


465216 
= 0100 
0110 
0101 
0010 
FOF016 = 1111 
0000 
1111 
0000 


0011 
0111 
0100 
0010 


1 
1 1 
2 one bits, set Parity flag to 1 
Nonzero result, set Z to a 


No carry out of bit 3, 
set AF to a 


Carry out of high-order bit, 
set Carry status to 1 


I '-------------- 
Sign bit is 0, set Sign status 
to a 
L- 
Overflow is set to a 


ADD 
DX,jjkk 
Number of cycles: to memory: 17 + EA 
to register: 4 


1. 
This instruction 
is not normally used to ADD to the AX or AL registers, The ADD 
ac,data instruction 
accomplishes that function in fewer bytes, 


mem/reg" mem/reg2 


Register to Register 
Register to Memory 
Memory to Register 


Add: 1. 
2. 
3. 


Add the contents of the register or memory location specified by mem/reg2 
to the 


contents 
of the register or memory 
location specified by mem/regl• 
An 8- or 16-bit 


operation may be specified. Either mem/regl 
or mem/reg2 
may be a memory operand, 
but one of the operands must be a register operand. 
The encoding for this instruction 
is: 


ADD 
mem/reg 1. mem/reg2 


o 0 0 0 0 0 d w I 
mod reg rim 
L:= 


Addressing mode byte(sl as described 
earlier in this chapter 


w = 0 
S·bit operation 
w = 1 
16-bit operation 


'----------d 
is the direction flag. If d = 0, then the 
operand described by mod and rim 
is 
memlreg 1 and the operand described 
by reg is mem/reg2. 
If d = 1. then the 
operand described by mod and rim 
is 
mem/reg2 
and the operand described 
by reg is mem/reg 1. 


Suppose that the ex register contents are 002916 and the contents of the SI register are 
04ED16. After the instruction 


ADD 
SI,CX 


has executed, 
the SI register contents and the statuses will be altered as follows: 


002916 
= 0000 
0000 
0010 
1001 


04ED16=0000 
0100 
1110 
1101 


000 
0101 
0001 
0110 


1 
L 


Threeone bits. set 
the Parity Status to 0 


____________ 
carry out of bit 3, set AF to 1 


. 
Set Overflow Status to 0 


Set Sign Status to 0 


No Carry, set C to 0 
Non-zero result, set Z to 0 


ADD 
51.eX 
Number of cycles: Register to Register: 3 
Register to Memory: 
16 + EA 
Memory to Register: 9 + EA 


This instruction 
is used to AND immediate 
data present in the succeeding 
pro- 
gram memory byte(s) with the AL 
(8-bit operation) 
or AX 06-bit 
operation) 
register 
contents. 
The encoding for this instruction 
is: 
L 


k 


k 


__~ 
jj 
High_order a 
bits 
of 
the 
immediate 


operand. This byte is only present if w = 1. 


Low-order 
a 
bits 
of 
the 
immediate 


operand. This byte is always present. 


w = 0 
a-bit operation. 


w = 1 
16-bit operation. 


As an example, consider the case where the AL register contains C316• After the instruc- 
tion 


AND 
AL,7FH 


executes, the AL register will contain 4316, 


C316 = 1100 
0011 
7F16=01111111 


0100 
0011 
T--' t 
3 one bits, set P to 0 


~AF 
flag indeterminate 


Non-zero result, set Z to 0 


Overflow is set to 0 


'--------5ign 
is set to 0 


Carry is set to 0 


AND 
AL,kk 
Number of cycles: 4 


Notes: 


1. 
This instruction 
performs 
the same function 
as the 8080 instruction 
ANI data. 
However, 
it also allows a 16-bit operation. 


2. 
If you desire to AND immediate with any of the other general purpose registers or 
with some memory location, consult the AND mem/reg,data 
instruction. 


L_ 


AND immediate data present in the succeeding program memory byte(s) with the 
specified register or memory location. An 8- or 16-bit operation 
may be specified. 
The encoding for this instruction 
is: 
L: 


kk 
c=eHigh-order 
byte of the 16-bit 


immediate operand.This byte 
is only present if w = 1. 


------- 
Low-order 
8 
bits 
of 
the 


immediate operand. This byte 
is always present. 


Addressing mode byte(s). As 
described earlier in this chapter. 


w = 0 
8-bit operation. 


w = 1 
16-bit operation. 


Consider 
the case where the BX register contains 
010416, 
the OS register contains 
000016, and the byte at memory location 0010416 is 4716, After the instruction 


AND 
[8X!.52H 


has executed, memory location 0010416 will contain 4216, 


4716 = 0100 
0111 
5216 = 0101 
0010 
0100 
0010 


L2 
one bits. set the Parity 


flag to 1 


Non-zero result. set Z to 0 


Carry is cleared to 0 


'---------5ign 
is set to 0 


Overflow is cleared to 0 


AND 
[BXl.kk 
Number of cycles: 
Immediate to memory: 
17 + EA 
Immediate to register: 4 


Notes: 


1. 
This instruction 
is not normally used to AND data with the AX or AL registers. The 
instruction 
AND ac,data is provided for this function. 


AND 


AND: 


mem/reg" 
mem/regz 


Register with Register 
Register with Memory 
Memory with Register 


AND the contents of the register or memory location specified by mem/reg2 
with 
the contents 
of the register or memory location specified by mem/reg" 
returning 
the 
result to mem/regl. 
An 8- or 16-bit operation 
may be specified. Either mem/regl 
or 
mem/reg2 
may be a memory 
operand, 
but one of the operands 
must be a register 
operand. 


The encoding for this instruction 
is: 


AND 
mem/reg 1, mem/reg2 


001000dw 
~ 


Addressing mode byte (s) as described 
earlier in this chapter. 


w = 0 
a-bit operation 
w = 1 
16-bit operation 


d is the direction flag. If d = O. then the 
operand described by mod and rim 
is 
mem/reg 1 and the operand described 
by reg is mem/reg2. 
If d = 1, then the 
operand described by mod and rim 
is 
mem/reg2 
and the operand described 
by reg is mem/regl' 


As an example, consider the case where the DL register contains 0616, the DS register 
contains BOOOI6,the BX register contains 001016, the SI register contains 000616, and the 
byte at memory location B001616 contains F116. After the instruction 


AND 
DL, (BX + SI) 


0616 = 0000 
0110 


Fl16 = 1111 
0001 
0000 .•..... 
0000 
! 1 L,.ro O~ 
";" 
,."" 
, 
AF flag indeterminate 


Carry is cleared to 0 


Set the Sign Status to 0 


Overflow Status is cleared 


Zero result, set Z to 1 
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Data 


0 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 


xx 
rrrrr 


AX 


BX 
gg 
gg 


CX 


DX 
yy 
Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 
22 
ppppm 


BP 
qq 
qq 
10 
ppppm + 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
hh 
hh 
Program Memory 
Address Calculation 


55 


E5 


AND 
DL, [BX + 51] 
Number of cycles: 
Memory to Register: 9 + EA 
Register to Memory: 
16 + EA 
Register to Register: 3 


Store the contents of the CS and PC registers on the top of the stack, i.e., push the 
address of the instruction 
following the CALL onto the top of the stack. Place the con- 
tents of the succeeding four memory bytes into the PC and CS registers. Place the bytes 
in the following manner: 


1. 
Store the second and third bytes of this instruction 
into the PC register. 


2. 
Store the fourth and fifth bytes of this instruction 
into the CS register. 


The encoding for this instruction 
is: 


CALL 
addr 
-.- 
9A 


~ L High-order 8 bits of the new segment 
address. This byte is stored into the 
high-order byte of the CS register. 


Low-order 8 bits of the new segment 
address. This byte is stored into the 
low-order byte of the CS register. 


High-order 
8 bits of the 
new offset 


address. This byte is stored into the 
high-order byte of the PC register. 


Low-order 
8 
bits of 
the 
new 
offset 
address. This byte is stored into the 
low-order byte of the PC register. 
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T 
S 
Z 
A 
P 
C 


PSW 
~~I 


SP 
ss 
ss 


6P 


SI 


01 


PC 
mm 
mm 


CS 
nn 
nn 


OS 


SS 
tt 
tt 


ES 


Data 
Memory 


mm + 5 


mm 


nn 


nn 


Program 
(Relative 
to the 


Memory 
CS Register! 


9A 
ppppm 


kk 
ppppm + 1 


li 
ppppm + 2 


hh 
ppppm + 3 


gg 
ppppm + 4 


CALL 
addr 
Number of cycles: 
26 


1. 
There are four types of CALLs: 


CALL addr: this instruction, 
intersegment 
CALL 
CALL mem: intersegment 
indirect CALL 


CALL disp: intrasegment 
CALL 


CALL mem/reg: 
intrasegment 
indirect CALL 


Push the address of the instruction 
following the CALL onto the top of the stack. 


Add the contents 
of the next two program memory 
bytes, treating them as a 16-bit 
unsigned displacement, 
to the program counter. Continue 
execution 
from this point. 


The encoding for this instruction 
is: 


CALL displ6 
-...- 
E8 


T,,,"_,",,,, "" 
of 16-bit displacement. 


---------Low-order 
8 bits 
of the 16-bit displacement. 


As an example, consider the following instruction 
sequence: 


CALL 
SUBR 
AND 
AL,7FH 


After the CALL instruction 
has executed, the address of the AND instruction 
will have 
been pushed onto the stack, and the PUSH instruction 
at SUBR will be executed next. 
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Data 
(Relative to the 


0 
0 
T 
S 
Z 
A 
P 
C 
Memory 
S5 Regiserl 


PSW 
uuuus - 
3 


mm + 3 
uuuus - 
2 
;~I 


mm 
uuuus 
- 


uuuus 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
ss 
ss 
Ea 
ppppm 


BP 
kk 
ppppm + 


SI 
jj 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


OS 


SS 
tt 
tt 


ES 


CALL 
jjkk 
Number of cycles: 19 


1. 
There are four types of CALLs: 


CALL disp: this instruction, 
intrasegment 
CALL 


CALL mem/reg: 
intrasegment 
indirect CALL 


CALL address: 
intersegment 
CALL 


CALL mem: intersegment 
indirect CALL 


Store the contents of the CS and PC registers on the top of the stack, i.e., push the 
address of the instruction 
following the CALL onto the stack. Move the word at the 
specified memory location into the PC register; move the succeeding word into the CS 
register. Continue 
execution 
from this point. 


CALL 
mem 
-.- 
FF 


Addressing mode byte (sl. As described 
earlier in this chapter. 


Suppose the OS register contains 040016, 
the SI register contains 000416, 
the memory 
word at 0400416 is 010016 and the memory word at 0400616 is OFEOI6. 
After the instruc- 
tion 


has executed, the PC register will contain 010016 and the CS register will contain OFEOI6. 
Execution will continue from location OFFOOI6. 


0 
D 


PSW 


~jl 


SP 
ss 


SP 


SI 
gg 


DI 


PC 
mm 


CS 
nn 


DS 
hh 


SS 
It 


ES 


• Stack Data Memory Address Calculation 


•• Program Memory Address Calculation 


••• Data Memory Address Calculation 


CALL 
[SIJ 
Number of cycles: 
37 + EA 


1. 
There are four types of CALLs: 


CALL mem: this instruction, 
intersegment 
indirect CALL 


CALL addr: intersegment 
CALL 


CALL mem/reg: 
intrasegment 
indirect CALL 


CALL disp: intrasegment 
CALL 


2. 
If mod = 11, this operation is undefined. 


Data 
(Relative to the 


Memory 
DS Register) 


yy 
rrrrg 


xx 
rrrrg + 1 


ww 
rrrrg + 2 


vv 
rrrrg + 3 


Data 
IRelative to the 


Memory 
SS Register) 


mm + 2 
uuuus - 
4 


mm 
uuuus - 
3 


nn 
uuuus - 
2 


nn 
uuuus 
- 


uuuus 


Program 
(Relative 
to the 


Memory 
CS Register) 


FF 
ppppm 


1C 
ppppm + 


ppppm + 2 


ppppm + 3 


Store the address of the instruction following the CALL on the top of the Stack. If 
the specified operand is a register, move the contents of the register to the PC register. If 
the specified operand is a memory location, move the contents of the specified memory 
location to the PC register. Continue 
execution 
from this point. 


The encoding for this instruction 
is: 


CALL 
mem/reg 
-.- 
FF 


Addressing mode byte(sl. 
As described earlier in this chapter. 


Consider 
the case where the PC register contains 
FFOOI6, the DS register contains 
010016, the BX register 002616, and the word at memory location 0102616is 024016' After 
the instruction 


has executed, 
the PC register will contain 024016' Execution will continue at this loca- 
tion. 
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Data 
(Relative to the 


T 
S 
Z 
A 
P 
C 
Memory 
DS Register) 


PSW 


yy 
rrrrg 


AX 
xx 
rrrrg + 1 


BX 
gg 
gg 


CX 


DX 


Data 
(Relative to the 


Memory 
5S Register) 


SP 
ss 
ss 
mm + 2 
uuuus - 
2 


BP 
mm 
uuuus 
- 


SI 
uuuus 


DI 


PC 
mm 
mm 


Program 
(Relative 
to the 
CS 
nn 
nn 
Memory 
CS Register) 


DS 
hh 
hh 
FF 
ppppm 


SS 
tt 
tt 
17 
ppppm + 


ES 
Stack Data Memory 
Address Calculation 
ppppm + 2 


ppppm + 3 


CALL 
[BX] 
Intrasegment indirect through memory (as illustrated above) 


Number of cycles: 
21 + EA 


CALL BX 
Intrasegment indirect through register 
Number of cycles: 
16 


1. 
There are four types of CALLs: 


CALL mem/reg: 
this instruction, 
intrasegment 
indirect CALL 


CALL disp: intrasegment 
CALL 


CALL mem: intersegment 
indirect CALL 


CALL addr: intersegment 
CALL 


If the high-order bit of the AL register is I, store FFI6 into the AH register, other- 


wise store 0016 into the AH register. 


The encoding of this instruction 
is: 


CBW-.- 
98 


As an example, if the AL register contains 4F 16' the executing instruction 


0 
D 


PSW 


AX 


ax 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


CS 
nn 


DS 


SS 


ES 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


CBW 
Number 
of cycles: 
2 


2. 
The value in the AL register should represent a number between + 127 and - 128, 
i.e., AL should contain a signed 8-bit value. 


3. 
This instruction 
can be used for extending the AL register before a 16-bit IMUL or 


IDlY instruction. 


This instruction 
sets the Carry status 
to O. No other statuses 
or registers 
are 
affected. 


The encoding for this instruction 
is 


CLC 
-..- 
F8 


Data 


Memory 


AXI 
8X --------- 
CX ------- 
OX ------- 
Program 
(Relative to the 


Memory 
CS Register! 


CLC 
Number of cycles: 2 


This instruction 
sets the DF flag to O. This has the effect of making the string 
operations 
perform auto-increment 
on the pointers used by the string operations. 
No 
other statuses or registers are affected. 
The encoding for this instruction 
is: 


CLD-.- 
FC 


Data 


Memory 
pswqrrrrIJJ 
~~I I 
I 
Program 
(Relative to the 


Memory 
CS RegISter! 


CLD 
Number of cycles: 2 


Set the Interrupt 
flag to O. This has the effect of disabling all interrupts 
except 
non-maskable 
interrupts, 
which occur on the NMI line. 
The encoding for this instruction 
is: 


CLI 


FA 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 


BP 


51 


01 


PC 
mm 
mm 


C5 
nn 
nn 


05 


55 


E5 


CLI 
Number of cycles: 2 


I. 
This instruction 
performs the same function as the 8080 instruction 
OI. 


2. 
Remember 
that when the 8086 acknowledges 
an interrupt 
request, 
the interrupts 


are automatically 
disabled. 


Complement 
the Carry status. No other statuses or registers are affected. 


The encoding for this instruction 
is: 


CMC 
-.- 
F5 


For example, 
if the Carry status were 0 and the instruction 


CMC 


were executed, 
the Carry status would be set to 1. 


Data 


Memory 


AXI 
BX _------01------- 
ex _______ 
01- 
_ 


ox _______ 
01- 
_ 
Program 
(Relative to the 


Memory 
es Register) 


CMC 
Number of cycles: 2 


This instruction is used to compare immediate data present in the succeeding pro- 
gram memory byte(s) with the AL register (8-bit operation) 
or the AX register 06-bit 
operation). 
The comparison 
is performed 
by subtracting 
the data in the immediate 
byte (s) from the specified register and using the result to set the flags. The result of this 
operation is not stored in the specified register, thus no registers are affected, only the 
statuses. 


The encoding for this instruction 
is: 


I 001 
1 1 1 0 W 


j 


l LP__ 
c==e 


ii 


The high-order a bits of the immediate 
L 
operand. This byte is only present if w = 1. 


The low-order a bits of the immediate 
operand. This byte is always present. 


w = 0 
a-bit operation 
w = 1 
16-bit operation 


Consider the case where the AL register contains 2016' After the instruction 


CMP 
AL.ODH 


has executed, 
the AL register will still contain 2016, but the statuses will be modified as 
follows: 


2016 = 0010 
0000 


Two's Comp. of OD16 = 1111 
0011E 


O~001T001 
L3 one bits. set P to 0 


~ 
No Carry, set AF to 0 


Carries out of both bit 6 and bit 7, 
set Overflow to 0 


Set sign to 0 
L-------Carry 
out of high-order bit is 1, 


set Carry to 0 


Nonzero result, set to 0 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


CMP 
AL,kk 
Number of cycles: 
4 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


Notes: 


1. 
If you desire to compare 
immediate 
data with any of the other general purpose 
registers or with the contents of some memory location, consult the CMP mem/reg, 
data instruction. 


2. 
This instruction 
performs the same function as the 8080 instruction 
CPI data. In 
addition, this instruction 
allows for 16-bit comparisons. 


This instruction 
compares the immediate data present in the succeeding program 


memory byte(s) with the specified register or memory location. The comparison 
is per- 


formed by subtracting the data in the immediate bytes from the specified memory loca- 
tion or register, and using the result to set the flags. The result of this operation is not 
stored in the specified register or memory location, thus no registers or memory loca- 
tions are affected, only the statuses. An 8-bit or 16-bit operation 
may be specified. 


The encoding for this instruction 
is: 


1100000swl 


CMP 
mem/reg.data 


Imodlllr/mi Lk ~ 


High-order 
byte 
of 
the 
Immediate 


operand.This byte ISonly present If 


_ 
s=Oandw 
= 1 


------Low-order 
byte 
of 
the 
immediate 


operand.This byte is always present 


'----------------Addressing 
mode byte(sl as described 


earlier in this chapter 


L------- 
w = 0 
a-bit operation 


w = 1 
16-bit operation 


I--------------------s 
is the sign extensionbit. If w = 0, this 


bit is ignored.If w = 1 then s = 0; all 16 
bits of the immediate operand are pre- 


sent 


s = 1, only the low-order a bits of the 
immediate operand are present. The 
high-order a bits of the 16-bit operand 
are formed by sign extending the high- 
order bit of kk 


Suppose that the Sl register contains OlBAI6, After t~e instruction 


CMP 
51,0200H 


has executed, 
the SI register will still contain OlBAI6, but the statuses will be modified 
as follows: 


01aA16 = 0000 
0001 
1011 
1010 
Two's Comp. of 0200,6 
= 1111 
1110 
1011 
1010 


111' 
1111 
1011~1010 


I' 


t 
t 
5 one bits, set P to 0 
No Carry from bit 3, set AF to 1 


------------ 
No Carry out of bit 15 or bit 14. 


_ 
set Overflow to 0 


------------5et5ignto 
1 


No Carry out of high-order bit, 
set Carry to 1 


Non-zero result, set Z to 0 


-"-0 
D 


PSW 
~~I 


SP 


BP 


SI 


DI 


PC 


CS 


DS 


SS 


ES 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


CMP 
Number 
of cycles: 
Register operand: 
4 
Memory 
operand: 
10 + EA 


Notes: 


1. 
This instruction 
is not typically used to CMP immediate 
data with the AX or AL 
register. The instruction 
CMP ac,data is provided for this purpose. 


CMP 
mem/reg" 
mem/reg2 


Compare: 
Register with Register 
Register with Memory 
Memory with Register 


Compare the data in the register or memory operand specified by mem/reg2 with 


the data in the register or memory operand specified by mem/reg1. The comparison 
is 


performed 
by subtracting 
the data specified by mem/reg2 
from the data specified by 
mem/regl 
and using the result to set the flags. Neither 
mem/reg[ 
nor mem/reg2 
is 
affected by this operation. 
An 8- or 16-bit operation 
may be specified. 


The encoding for this instruction 
is: 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
B-bit operation 
w = 1 
16-bit operation 


d is the direction flag. If d = 0, then the 
operand described by mod and rim 
is 
mem/reg 1 and the operand described 
by reg is mem/reg2. If d = 1. then the 
operand described by mod and rim 
is 
mem/reg2 
and the operand described 
by reg is mem/reg 1 


Suppose that the DH register contains 05[6 and the CL register contains 0616,After the 
instruction 


has executed, neither the CL nor DH register will be affected; however, the flags will be 
set as follows: 


0616 = 0000 
0110 
Two's Compoof 0516 = 1111 
1011 


0000 
0001§~ 


L1 
onebit,setPtoO 


~SetAFtoO 


. Set Overflow to 0 


Set Sign to 0 


Carry out of high-order bit is completed, 
set Carry to 0 


Non-zero result, set Z to 0 


-'- 
0 
D 
T 


PSW 


AX 


BX 


CX 


DX 
yy 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


CMP 
CL,DH 
Number of cycles: 
Register with register: 
3 
Memory with register: 
9 + EA 
Register with memory: 
16 + EA 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


Compare the contents 
of the memory location addressed 
by the SI register with 
the contents 
of the memory location addressed 
by the DI register. The comparison 
is 
performed 
by subtracting 
the contents 
of the memory 
location addressed 
by the DI 
register from the contents of the memory location addressed by the SI register and using 
the result to set the flags. Neither of the memory locations used in the subtraction 
is 
affected. The SI and DI registers are incremented/decremented 
depending on the value 
of the DF flag. An 8- or 16-bit operation may be specified. 
The encoding for this instruction 
is: 


CMP5 


1'0'00' 
, wi 
Lw=o 
An a-bit comparison. The 51 
and 
01 registers are incre- 
mented by , if OF= O.The 51 
and 01 registers are decre- 
mented by , if OF = , 


A '6-bit 
comparison. The 51 
and 01 registers are incre- 
mented by 2 if OF= O.The 51 
and 01 regisers are decre- 
mented by 2 if OF = , 


Suppose that the DF flag is 1, the DS register contains 060016, the SI register contains 
010816, 
the ES register contains 
006016, 
the DI register contains 
018816, 
the word at 
memory location 0610816 is 454416 and the word at memory location 0078816 is 454416, 
After the instruction 


CMP5 
WORO 


has executed, the SI register will contain 010A16, the DI register will contain 018AI6 and 
the flags will be set as follows: 


4544,6 
= 0'00 
0'0' 
0'00 
0'00 


Two's Compoof 4544,6 
= _'_0_'_' 
_'_0_'_0_'_0_'_'_'_'_0_0 
0000 
0000 
0000~0000 
l~ 


t 
toone 
bits. set P to , 


Carry out of bit 3, set AF to , 


.----------- 
Overflow to 0 


-----------5et 
5ign to 0 


------------Carry 
out of high-order bit. 


set Carry to 0 


Zero result, set Z to , 


P5W 
~~I 


5P 


BP 


51 
gg 


DI 
qq 


PC 
mm 


C5 
nn 


D5 
hh 


55 


E5 
It 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
C5 Register) 


Program 
(Relative 
to the 


Memory 
C5 Register) 


CMP5 WORD 
Number of cycles: 22 for a single occurrence. 


9 + (22 • RI for R repetitions 
when preceded by a REP prefix. 


1. 
The REP prefix and/or the LOCK prefix may be used with this instruction. 
If the 
LOCK prefix and the REP prefix are both used in conjunction with the instruction, 
certain problems may occur. For a discussion of this enervating subject, please con- 
sult Chapter 4. 


2. 
The default segment 
register for the operand addressed 
by SI is OS. The segment 
register may be changed 
using a segment 
override 
prefix. The default segment 
register for the operand addressed 
by 01 is ES. This segment 
register assignment 
may not be overridden. 


3. 
The assembler must have certain information to allow it to determine 
whether an 8- 
bit or 16-bit comparison 
will be performed. 
For a discussion 
of how this is done, 
please consult the end of this chapter. 


4. 
The execution 
time for CMPS with a REP prefix may be illustrated as follows: 


REP 
CMP5 
-'- 
~ 
2 + 
9 + 22(R) 


If R 
10 words, then the execution time is 231 clock cycles. 


If the high-order 
bit of the AX register is 1, store FFFFI6 
into the DX register, 
otherwise store 000016 into the DX register. 
The encoding for this instruction 
is: 


CWD 


Suppose that the AX register contains 800116, 
After the instruction 


CWD 


Data 


Memory 


Program 
(Relative to the 


Memory 
CS Register! 


C51 
D5 ------- 
55 ------- 
ES ------- 


CWD 
Number of cycles: 5 


1. 
No statuses are affected. 


2. 
This instruction is useful when performing divisions. If a 16-bit divisor is to be used, 
it is necessary to have a 32-bit dividend. If the only significant bits are in the AX 
register, this instruction 
extends the sign bit into the DX register to make a 32-bit 
dividend. Note that this technique 
works best for the IDIV instruction. 


Convert 
the contents 
of the AL register into binary coded decimal form. This 
instruction 
should be used only after adding two BCD numbers, 
i.e., look upon ADD 
DAA or ADC DAA as compound, 
decimal arithmetic 
instructions 
which operate on 
BCD source operands to generate BCD answers. 


The algorithm for the conversion 
is: 


1. 
If the AF flag is 1 or the low-order four bits of the AL register are A through 
F, then add 0616 to the AL register and set the AF flag to 1. 


2. 
If the CF flag is 1 or the high-order four bits of the AL register are greater than 
9, then add 6016 to the AL register and set the CF flag to 1. 


The encoding for this instruction 
is: 


DAA 


27 


Suppose the AL register contains 
2816 and the BL register contains 
6816, 
After the 
instructions 


Data 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 
yy 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
27 
ppppm 


BP 
ppppm 
+ 1 


SI 
ppppm 
+ 2 


DI 
ppppm 
+ 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 


SS 
Address Calculation 


ES 


DAA 
Number of cycles: 
4 


1. 
This instruction 
is useful for the addition of two packed BCD operands. For adjust- 
ing the subtraction 
of two packed BCD operands, 
consult the DAS instruction. 
For 
adjusting the result of ASCII addition and subtraction, 
consult the AAA and AAS 


instructions. 


This instruction 
converts 
the contents 
of the AL register 
into binary coded 


decimal form. This instruction should only be used after subtracting two BCD numbers, 
i.e., look upon SUB DAS or SBB DAS as compound 
decimal arithmetic 
instructions 
which operate on BCD source operands to generate BCD answers. 


The algorithm for the conversion 
is: 


1. 
If the AF flag is 1 or the low-order four bits of the AL register are between A 
and F, then subtract 0616 from the AL register and set the AF flag to 1. 


2. 
If the CF flag is 1or the high-order four bits of the AL register are greater than 
9, then subtract 6016 from the AL register and set the CF flag to 1. 


The encoding for this instruction 
is: 


DAS-.- 
2F 


Suppose that the AL register contains 8616 and the AH register contains 0716, After the 
sequence 
of instructions 


has executed, 
the AL register will contain 7916, The SUB instruction 
results in the AL 
register containing 7F 16' 


8616 = 1000 
0110 


Two's Camp. of 0716 = 1111 
1001 


0111 
1111 


Since the low-order 
4 bits of the AL register equal F16, 
the first step of the 
algorithm is performed. 
The AF flag is set to 1. 


Data 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 
xx 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
2F 
ppppm 


BP 
ppppm + 1 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 


SS 
Address Calculation 


ES 


DAS 
Number of cycles: 
4 


Notes: 


1. 
This is a decimal subtraction 
adjustment 
algorithm for two packed BCD numbers. 


Another operation available for adjustment 
of subtractions 
is the AAS instruction, 


which adjusts the results of subtracting 
ASCII digits. 


Subtract 1 from the contents of the specified register or memory location. An 8- or 
16-bit operation may be specified. 


The encoding for this instruction 
is: 


DEC 
mem/reg 


1111111 
w 


Addressing mode byte (s) as described 
earlier in this chapter. 


w = 0 
8-bit operation 
w = 1 
16-bit operation 


Suppose that the BH register contains 4F 16' After the instruction 


DEC 
BH 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


DEC 
BH 


Number of cycles: 
Register operand: 3 
Memory operand: 
15 + EA 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


Notes: 


1. 
This instruction 
can perform the same function as the 8080 instruction 
DCR reg. 


Note that due to the various addressing modes available and the 8-/16-bit option, 
this instruction 
has a good deal more power than the 8080 instruction. 


2. 
Segment registers may not be modified using this instruction. 


3. 
This instruction 
would not normally 
be asked to decrement 
one of the 
16-bit 
registers. The instruction 
DEC reg performs this function and only occupies one 
byte of program memory space. This instruction would be used to decrement 
one of 
the 8-bit registers or memory. 


4. 
This instruction 
does not affect the Carry status. 


Subtract 1 from the contents 
of the specified register. This is a 16-bit decrement 
instruction. 


The encoding for this instruction 
is: 


01001r" 
I 
-.-L3 bits which specify the 16-bit 
register to be decremented 


rrr = 000 for AX 
001 for CX 
010 for DX 
all 
for BX 
100 for SP 
101 for BP 
110 for SI 
111 for DI 


As an example, examine the case where the ex register contains 020016, Executing a 


DEC 
CX 


Data 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 
~~I 


xx 
yy 
~"-0 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
49 
ppppm 


BP 
ppppm + 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 


SS 
Address Calculation 


ES 


DEC 
CX 


Number of cycles: 
2 


1. 
This instruction 
performs the same function as the 8080 instruction 
DCX reg. 


2. 
Segment registers may not be decremented 
using this instruction. 


DIV 
mem/reg 


Divide AX or DX:AX Registers by Register or Memory Location 


Divide the AX 06-bit 
operation) 
or DX:AX (32-bit operation) 
register 
by the 
contents 
of the specified register or memory 
location, 
considering 
both operands 
as 
unsigned 
binary numbers. 
If a 16-bit operation 
is performed, 
the 8-bit quotient 
is 
returned 
in the AL register, the 8-bit remainder 
is returned 
in the AH register. If the 
quotient to be returned to the AL register is greater than FF 16' then a type 0 (division by 
zero) interrupt 
is generated. 
If a 32-bit operation 
is performed, 
the 16-bit quotient 
is 
returned 
to the AX register, the 16-bit remainder 
is returned 
to the OX register. If the 
quotient to be returned to the AH register is greater than FFFFI6, 
then a type 0 (division 
by zero) interrupt 
is generated. 
A division by zero interrupt 
results in the following actions: 


I. 
Push the Flags register onto the stack. 


2. 
Clear the IF and TF flags. 


3. 
Push the CS register onto the stack. 


4. 
Load the word at memory location 0000216 into the CS register. 


5. 
Push the PC onto the stack. 


6. 
Load the word at memory location 0000016 into the PC register. 


The encoding for this instruction 
is: 


DIV 
mem/reg 


1111011 
wi 


1 


Addressing mode byte (s) as described 
earlier in this chapter. 


-------w 
= 0 16-bit operation 
w = 1 32-bit operation 


As an example, 
consider 
the case where the AX register contains 
OFOSI6, the OX 
register contains 068A 16' and the CX register contains 08E916. After the instruction 


has executed, the AX register will contain the quotient BBEI16, 
and the OX register will 
contain the 073C16• The values of the OF, SF, ZF, AF, PF, and CF flags are undeter- 
mined for this operation, i.e., you have no idea what the value of a particular flag will be 
following DIV. 


Data 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 
~~I 


. xx 
yy 


~ 


* :> 
gg 
hh 


ww 
vv 
Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
F7 
ppppm 


BP 
F1 
ppppm + 1 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 


SS 
Address Calculation 


ES 


*' wwvvxxyy 
is divided by gghh. Quotient is returned to AX. 
Remainder is returned to DX 


DIV 
CX 
Number of cycles: 32-bit 
memory divide: 
(150-16BI 
+ EA 
16-bit menlOry divide: 
(B6-96) + EA 
32-bit register: 
144-162 
16-bit register: 
BO-90 


1. 
The values for all of the arithmetic flags are undetermined 
after this instruction 
has 
executed. 


2. 
If it is necessary to determine 
whether the DIY instruction 
will result in a division 
by 0 interrupt 
prior to the execution of the DIY instruction, 
the following instruc- 
tion sequences 
will prove helpful. 


16-bit division: 
Assume that CL contains the divisor. 


This sort of check would be useful if the divide by zero interrupt handler was not 
sufficient for your purposes. 


This instruction 
places the contents of the specified memory location on the data 
bus. Essentially this instruction 
performs no operation as far as the 8086 is concerned. 
This instruction 
is used to allow other processors to make use of 8086 addressing modes 
and to receive their instructions 
from the 8086 instruction 
stream. 


The encoding for this instruction 
is: 


Ese 


I 
1 1 01 
1 x x x I 


Addressing 
mode 
information 
as 


described earlier in this chapter 


x is a don't care bit, i.e.,this may be a 0 
or a 1. Note that these don't care bits 
result 
in 
all 
instructions 
with 
initial 
opcode between DB16 and DF16 being 
considered ESe instructions 


Suppose that the BX register contains 063A16, 
the SI register contains 000316, 
the OS 
register contains FF8016, 
and the word at memory location FFE3DI6 
is C30816. 
When 
the instruction 


executes, 
at the time when the READY line is asserted 
by the addressed 
memory 
device, C30816 will be present on the data lines. 


3-108 
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Data 


0 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 
yy 


xx 


AX 


BX 
gg 
gg 


CX 


DX 


Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 
D8 
ppppm 


BP 
00 
ppppm + 1 


51 
hh 
hh 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 
Program Memory 
Address Calculation 
D5 
qq 
qq 


55 


E5 


E5C 
Number of cycles: 8 + EA 


Notes: 


1. 
If mod = 11 (i.e., a register is addressed), 
this instruction 
performs no operation. 


CLOCK CYCLES = 2. 


When the HLT instruction 
is executed, 
program execution ceases. It requires an 
external interrupt 
or a reset to restart execution. 
No registers or statuses are affected. 


CAUTION: 
If interrupts are not enabled by an STI instruction prior to the HLT instruc- 
tion, the 8086 CPU cannot exit the Halt state except by activation of the 
hardware Reset or nonmaskable 
interrupt. 


The encoding for this instruction 
is: 


HLT 


0 
0 


psw 
~~I 


SP 


BP 


SI 


01 


PC 
mm 


CS 
nn 


os 


SS 


ES 


Data 


Memory 


Program 
(Relative to the 


Memory 
CS RegIster) 


HLT 
Number of cycles: 2 


Divide the AX 06-bit 
operation) 
or DX:AX C32-bit operation) 
register by the 
contents 
of the specified register or memory 
location, 
considering 
both operands 
as 
signed binary numbers. 
If a 16-bit operation is performed, 
the 8-bit quotient is returned 
in the AL register; the 8-bit remainder 
is returned 
in the AH register. If the quotient to 
be returned 
is greater than 7F16, then a type 0 (division by zero) interrupt is generated. 


If a 32-bit operation is performed, 
the 16-bit quotient is returned to the AX register, the 
16-bit remainder is returned to the DX register. If the quotient to be returned to the AX 
register is greater than 7FFFF 16' then a type 0 (division by zero) interrupt is generated. 
A division by zero interrupt 
results in the following actions: 


1. 
Push the Flags register onto the stack. 
2. 
Clear the IF and TF flags. 
3. 
Push the CS register onto the stack. 
4. 
Load the word at memory location 0000216 into the CS register. 


5. 
Push the PC onto the stack. 


6. 
Load the word at memory location 0000016 into the PC register. 


The encoding for this instruction 
is: 


, , , , 0 , , w 


1 


__' 
Addressing mode byte(sl as described 
earlier in this chapter. 


w = 0 
, 6-bit operation 
w =, 
32-bit operation 


Suppose that the CL register contains ODI6 and the AX register contains 00A916. 
After 
the instruction 


0 
0 


PSW 


AX 
xx 


BX 


CX 


OX 


SP 


BP 


SI 


01 


PC 
mm 


CS 
nn 


OS 


SS 


ES 


Data 


Memory 


Program 
{Relative 
to the 


Memory 
CS Reglsterl 


·Return quotient to the AL register. 
Return remainder to AH register 


IDIV 
CL 


Number of cycles: 
16-bit memory division: 
1107-1181 + EA 
32-bit memory division: 
1171-1901 + EA 
16-bit register division: 
101-112 


32-bit register division: 
165-184 


1. 
This is the signed number division instruction. 
Both operands are treated as signed 
binary numbers 
in the range: 


8-bit operation: 
+127 
to -128 
16-bit operation: 
+32767 
to -32768 


For an unsigned division, consult the DIY instruction. 


2. 
After this instruction 
executes, 
the values of the flags are unknown. 


Multiply the specified register or memory 
location contents 
by the AL (8-bit 
operation) 
or AX 06-bit 
operation) register considering both operands as signed binary 
numbers, 
i.e., perform a signed multiplication. 
If an 8-bit operation is performed, 
the 


low-order 8 bits of the result will be stored in the AL register, the high-order 8 bits of the 
result will be stored in the AH register. If a 16-bit operation is performed, 
the low-order 
16 bits of the result are stored in the AX register, the high-order 
16 bits of the resulCare 
stored in the OX register. In either case, if the high-order 
half of the result is the sign 
extension of the low-order half of the result then the Overflow and Carry flags are set to 
0, otherwise they are set to 1. (For example, 
if an 8-bit operation 
is performed, 
if the 
value returned to the AH register is not 0016 or FFI6, then the Carry and Overflow flags 
will be 1.) 0 status values mean that AH or OX contains significant digits. 


The encoding for this instruction 
is: 


Addressing mode byte (s) as described 
earlier in this chapter. 


---- 
w = 0 
S-bit operation 
w = 1 
16-bit operation 


As an example, 
consider 
the case where the AX register contains 
04E816, 
the DS 
register contains 010016, the BX register contains 000616 and the word at memory loca- 
tion 0100616 is 4E2016• After the instruction 


IMUL 
AX. [SX) 


has executed, 
the AX register will contain 4DOOI6, the OX register will contain 017FI6 
and the Carry and Overflow statuses will be 1. 


8086 Assem bly Language Instruction 
Set 
3-113 


Data 
(Relative to the 
0 
D 
T 
5 
Z 
A 
P 
C 
Memory 
D5 Register) 


P5W 
vv 
rrrrg 


ww 
rrrrg + 1 


AX 
xx 
yy 


BX 
gg 
gg 


CX 


DX 


Program 
(Relative to the 


Memory 
C5 Register) 


5P 
F7 
ppppm 


BP 
2F 
ppppm + 1 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
hh 
hh 


55 


E5 


IMUL 
AX,IBX] 
Number of cycles: 
B-bit memory multiply: 
(B6 + 104) + EA 
16-bit memory multiply: 
(134 + 160) + EA 


8-bit register multiply: 
80-98 
16-bit register multiply: 
128-154 


Notes: 


1. 
This is the signed number multiply operation. Both operands are treated as numbers 
in the range: 


8-bit operation: +127 
to -128 
16-bit operation: 
+32767 
to -32768 


For an unsigned multiply operation, 
consult the instruction 
MUL. 


2. 
In some cases, it may be more appropriate to use shifts to perform multiplications, 
These cases would occur when memory conservation 
is not of paramount 
impor- 
tance and speed is necessary, 


3. 
After this instruction 
has executed, 
the values of the Sign, Zero, Arithmetic, 
and 


Parity flags are undefined. 


This instruction 
loads 8- or 16-bit data elements 
into the AL (8-bit transfer) 
or 
AX 06-bit 
transfer) 
register from the I/O port identified by the contents 
of the OX 
register. 


The encoding for this instruction 
is: 


IN 
ac.DX 


111011 
Ow I 
lw=o 
B-bit data transfer to AL 


16-bit data transfer to AX 


No other registers (with the exception of AL or AX) or statuses are affected. 
Suppose that the OX register contains 
123416, the I/O buffer at Port 123416 con- 
tains 2316, and I/O buffer at Port 123516 contains F416. Executing an 


IN 
AX,DX 


will load 2316 into the AL register and F416 into the AH register. 


0 
D 


P5W 


AX 


BX 


CX 


DX 


5P 


BP 


51 


DI 


PC 


C5 


D5 


55 


E5 


IN 
AX,DX 
Number of cycles: B 


Data 
Memory 


1. 
This instruction 
allows the user to access input ports which have been assigned 


addresses between 0 and FFFFI6, 


This instruction 
loads 8- or 16-bit data elements 
into the AL (8-bit transfer) 
or 
AX 06-bit 
transfer) 
register from the I/O port identified 
by the second byte of the 
instruction. 


The encoding for this instruction 
is: 


IN 
aC,port 


11110010LW_I. 
t 
I 
yy is 8-bits specifying the 1/0 port 


w=O 


No other registers (with the exception of AL or AX) or statuses are affected. 
Suppose that the I/O buffer at Port 0616 contains 4316, Executing an 


IN AL,06H 


0 
D 


P5W 


AX 


BX 


CX 


DX 


5P 


BP 


51 


DI 


PC 
mm 


C5 
nn 


D5 


55 


E5 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
C5 Register! 


IN 
AL,yy 
Number 
of cycles: 
10 


1. 
This 
instruction 
allows 
the 
user 
to access 
I/O 
ports 
which 
have 
been 
assigned 
addresses 
between 
0 and 
FF 16' To address 
ports 
whose 
addresses 
are outside 
this 
range, 
consult 
the instruction 
IN aC,DX. 


2. 
This 
instruction 
performs 
the same 
function 
as the 
8080 instruction 
IN port. 


Add 1 to the contents of the specified register or memory location. An 8- or 16-bit 
operation 
may be specified. 
The encoding for this instruction 
is: 


INC 
mem/reg 


1111111 
w 


Addressing mode byte (s) as described 
earlier in this chapter 


w = 0 
8-bit operation 
w = 1 
16-bit operation 


Consider 
the case in which the OS register contains 
F80016, 
the contents 
of the BX 


register are 028016, the SI register contains 1E16, and memory location F829EI6 
contains 
6416, After the execution of the instruction 


INC 
[BX + SI) 


location F829E16 
will contain 6516, 


Data 
IRelative to the 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
DS Register! 


PSW 
xx 
zzzzz 


ZIZZZ 
+ 1 


AX 


BX 
gg 
gg 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
FE 
ppppm 


8P 
00 
ppppm + 


SI 
hh 
hh 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
rr 
rr 
Program Memory 


SS 
Address Calculation 


ES 


INC 
[BX + SI) 


Number of cycles: Memory Operand: 15 + EA 
Register Operand: 3 


Notes: 


1. 
Segment registers may not be incremented 
by this instruction. 


2. 
This instruction 
can perform the same function as the 8080 instruction 
INR reg. 
Note also that this instruction 
has a good deal more power than the 8080 instruc- 
tion. 


3. 
This instruction 
would not normally 
be used to increment 
one of the 
16-bit 
registers. The instruction 
INC reg performs this function and only occupies one byte 
of program memory space. This instruction 
would be used to increment 
one of the 
8-bit registers and memory locations. 


4. 
This instruction 
does not affect the Carry status. 


INC reg 


01000"r 
I 
L 3 
bits 
which 
specify 
which 
16-bit 
register is to be incremented. 


rrr = 000 for AX 
001 for CX 
010 for DX 
011 for BX 
100 for 5P 
101 for BP 
110 for 51 
111 for DI 


Consider 
the case where 
the contents 
of the SI register 
are 00FF16• 
Executing 
an 


INC 
51 


will result 
in the contents 
of the SI register 
being incremented 
to 010016. 


0 
D 


PSW 
~~I 


SP 


BP 


SI 
kk 


DI 


PC 
mm 


CS 
nn 


DS 


SS 


ES 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


INC SI 
Number 
of cycles: 
2 


Notes: 


1. 
This instruction 
performs the same function as the 8080 instruction 
INX reg. 


2. 
Segment registers may not be incremented 
using this instruction. 


3. 
This instruction 
does not affect the Carry status. 


This 
instruction 
performs 
the following 
sequence 
of operations: 


1. 
Push 
the Flags 
register 
onto 
the stack. 


2. 
Clear 
the IF and TF flags to O. 


3. 
Push 
the CS register 
onto 
the stack. 


4. 
Load 
the word 
at memory 
address 
OOxxx into 
the CS register. 
xxx is deter- 


mined 
by the low-order 
bit of the op-code 
and possibly 
the second 
byte of the 
instruction. 
If the low-order 
bit of the op-code 
is 0, then 
xxx is 00E16. 
If the 
low-order 
bit of the op-code 
is I, then 
xxx is equal 
to 2 plus 4 times 
the second 


byte of the instruction. 
In other 
words, 


IF low-order 
bit = 0 
THEN 
xxx = 00EI6 


ELSE 
xxx = (4*2nd 
byte) + 2 


5. 
Push 
the 
PC register 
onto 
the stack. 


6. 
Load 
the 
word 
at memory 
address 
OOyyy into 
the 
PC register. 
yyy is deter- 


mined 
by the low-order 
bit of the op-code 
and possibly 
the second 
byte of the 
instruction. 
If the low-order 
bit of the op-code 
is 0, then 
yyy is 00C16• 
If the 


low-order 
bit of the op-code 
is I, then 
yyy is equal 
to 4 times 
the second 
byte 
of the instruction. 
In other 
words, 


IF low-order 
bit = 0 
THEN 
yyy = 00CI6 


ELSE yyy = 4 * 2nd byte 


The 
encoding 
for this instruction 
is: 


INT 


I~ 


This byte IS only present If v = 1. Used 
to calculate interrupt vector address 


v = 0 
Interrupt 
vector 
address 
is ooooe 16 


v = 1 
Interrupt vector address is 4· 2nd 
byte 


of the instruction 


0 
0 
T 
S 
Z 
A 
P 
C 


PSW 
~~I 


SP 
ss 
ss 


BP 


SI 


01 


PC 
mm 
mm 


CS 
nn 
nn 


OS 


SS 
tt 
tt 


ES 


Program Memory Address Calculation 


Data Memory Address Calculation 


INT 
Number of cycles: 
52 if v = 0 
51 if v = 1 


Data 
Memory 


yy 
ooOOC 


xx 
00000 


ww 
ooooE 


vv 
ooooF 


00010 


Data 
(Relative 
to the 


Memory 
SS Register! 


mm + 
uuuus 
- 
6 


mm 
uuuus - 
5 


nn 
uuuus 
- 
4 


nn 
uuuus 
- 3 


Flags(LOI 
uuuus - 
2 


Flags(HI) 
uuuus 
- 


uuuus 


Program 
(Relative 
to the 


Memory 
CS Register) 


R 


c 
ppppm 


ppppm 
+ 1 


ppppm 
+ 2 


If the Overflow flag is 0, this instruction 
performs no operation. 
If the Overflow 
flag is I, the following sequence of events occurs: 


I. 
Push the Flags register onto the stack. 


2. 
Set the IF and TF flags to O. 


3. 
Push the CS register onto the stack. 


4. 
Move the word at memory location 0001216 into the CS register. 


5. 
Push the PC register onto the stack. 


6. 
Move the word at memory location 0001016 into the PC register. 


Continue 
execution 
from this point. 


The encoding for this instruction 
is: 


INTO 
-.- 
CE 


Data 
(Relative to the 


Memory 
DS Register! 


yy 
DOO016 


xx 
001116 


ww 
001216 


vv 
001316 


PSW 
~~I 


SP 
ss 
ss 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 
tt 
tt 


ES 


Data 
(Relative to the 


Memory 
SS Register! 


mm + 
uuuus - 
6 


mm 
uuuus 
- 
5 


nn 
uuuus 
- 
4 


nn 
uuuus - 
3 


Flags (LOI 
uuuus 
- 
2 


Flags (HI) 
uuuus 
- 


uuuus 


Program 
(Relative 
to the 


Memory 
CS Register) 


~ 


E 
ppppm 


ppppm + 1 


ppppm + 2 


ppppm + 3 


INTO 
Number of cycles: 
53 if overflow 
set 
4 if not 


Pop the two top stack bytes into the program counter; these two bytes provide the 
offset address for the next instruction 
to be executed. 
Pop the next two stack bytes into 
the CS register; these two bytes provide the code segment address of the next instruc- 
tion to be executed. 
Pop the next two stack bytes into the Flags register. Previous pro- 
gram counter, 
code segment and Flags register contents 
are lost. 


The encoding for this instruction 
is: 


IRET 
-.- 
CF 


Data 
(Relative to the 


Memory 
55 Register) 


yy 
uuuus 


0 
D 
T 
5 
Z 
A 
P 
C 
xx 
uuuus - 
1 


P5W 
ww 
uuuus - 
2 


vv 
uuuus - 
3 
~~I 


rr 
uuuus 
- 
4 


qq 
uuuus - 
5 


Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 
ss 
ss 
CF 
ppppm 


BP 
ppppm + 1 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 


55 
tt 
It 


E5 


IRET 
Number of cycles: 
24 


JA 
JNBE 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Carry flag and the Zero flag are 0; otherwise the next instruction 
is 
executed. 
The encoding for this instruction 
is: 


JNBE 
disp 
-.- 
77 


disp 
I 
L- This is an a-bit displacement byte. 


as described earlier in this chapter 


C = 0 and Z = 0 
~C = 1 or Z = 1 
r - - - - - - - - -J 
E 
NEXT 
I 
A 
D 
AL.7FH 
I 
- 


I 
I 
L - -NEXT 
XCHG 
BX. [BP + 51 + OF631H] 


after the JNBE instruction, 
the XCHG instruction 
is executed if the Carry flag and the 
Zero flag are O.The AND instruction 
is executed if the Carry flag or the Zero flag is 1. 


Number of cycles: 
Jump is performed: 
16 
Jump is not performed: 
4 


JAE 
JNB 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Carry flag is 0, otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JNB 
disp 
-.- 
73 


~ 
L- This 
IS an B-blt dIsplacement byte, 
as described earlier in this chapter 


~~~ 
~.I;=1 


I 
~ 
I 
I 
L - 
_ 
NEXT 
XCHG 


NEXT 
AL,7FH 


after the JNB instruction, 
the XCHG instruction 
is executed if the Carry flag is O. The 
AND instruction 
is executed if the Carry flag is 1. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


JB 
JNAE 
disp 
disp 


This instruction 
is identical to the JMP instruction 
except 
that the Jump 
is 
executed only if the Carry flag is 1. 


The encoding for this instruction 
is: 


JB 
disp 
-.- 
72cr 


This 
IS an 8-blt displacement byte, as 
described earlier in this chapter 


~:~ 
~:=o 


I 
_ 
I 
I 


L_NEXT 
XCHG 


NEXT 
AL,7FH 


after the 18 instruction, 
the XCHG instruction 
is executed 
if the Carry flag is 1. The 
AN D instruction 
is executed if the Carry status is O. 


Number of cycles: 
Jump is performed: 
16 
Jump is not performed: 
4 


JBE 
JNA 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed 
only if either the Carry status or the Zero status is 1; otherwise 
the next 
instruction 
is executed. 


The encoding for this instruction 
is: 


disp 
I 
LThiS 
is an a-bit 
displacement 
byte. 


as described earlier in this chapter 


~ 


c = 0 and Z = 0 
C = 1 or Z = 1 
r --- -- --J E 
NEXT 
I 
A 
D 
AL,7FH 
I 
- 
I 
I 
L _ 
NEXT 
XCHG 
BX. IBP + 51 + OF631 HI 


after the JBE instruction, 
the XCHG instruction 
will be executed if the Carry status or 
the Zero status is 1. If both the Carry and Zero statuses are 0, then the AND instruction 
will be executed. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the CX register is 0; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


~ 
disp 


E3 


~ 
L 
This 
IS an a-bit 
displacement 
byte. 


as described earlier in this chapter 


~X_=_O 
--Jt~: 
.,rc 0 NEXT 


I 
A 
D 
AL,7FH 


I 
I 
I 
I 
L- _ 
NEXT 
XCHG 
BX, [BP + 51 + OF631 Hl 


after the JCXZ instruction, 
the XCHG instruction 
is executed 
if the CX register is 0, 


The AND instruction 
is executed if the CX register is not O. 
Note that this instruction does not reference the Zero flag to determine 
if CX is 0; 


the CX register is referenced directly. 


Number of cycles: 
Jump is performed: 
1a 
Jump is not performed: 
6 


JE 
disp 
JZ 
disp 


Jump 
if Zero/Jump 
if Equal 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Zero status equals I; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JZ 
disp 
-..- 
74 


~ L 
This is an S-bit displacement byte. 
as described earlier in this chapter 


f~~-----~ro 


I 
- 
I 
I 
L-NEXT 
XCHG 


NEXT 
AL,7FH 


after the JZ instruction, 
the XCHG instruction 
is executed if the Zero status equals 1. 


The AND instruction 
is executed if the Zero status equals O. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


L 


JG 
JNLE 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Zero flag is 0 and the Sign flag equals the Overflow flag; otherwise 
the next instruction 
is executed. 
The encoding for this instruction 
is: 


JG 
disp 
-.- 
7F 


~ 
L 
This 
IS an a-bit displacement 
byte, 


as described earlier in this chapter 


~ 


Z=lors=o 


S = 0 and Z = 0 
r - - - - - - - 
NEXT 


I 
A 
0 
AL,7FH 


I 
I 
I 
L -NEXT 
XCHG 
BX, [BP + 51 + OF631H) 


after the JO instruction, 
the XCHO instruction 
will be executed if the Zero status is 0 
and the Sign status equals the Overflow status, If the Zero status is 1 or the Sign status 
does not equal the Overflow status, then the AND instruction 
is executed. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


JGE 
JNL 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed 
only if the Sign status is equal to the Overflow status; 
otherwise 
the next 
instruction 
is executed. 


The encoding for this instruction 
is: 


JNL 
disp 
-.- 
70 


~ 
LThiS 
is' an 
8-bit 
displacement. 
as 
described earlier in this chapter. 


5=0 
~5"'0 
i-------~~~ 


I 
- 
I 
I 
L _NEXT 
XCHG 


NEXT 
AL,7FH 


after the JNL instruction, 
the XCHG instruction is executed if the Sign status is equal to 
the Overflow status. The AND instruction will be executed if the Sign status is not equal 
to the Overflow status. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


JL 
JNGE 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed 
only if the Sign flag is not equal to the Overflow flag; otherwise 
the next 
instruction 
is executed. 
The encoding for this instruction 
is: 


JL 
disp 


'7C 


~ LThiS 
is 
an 
8-bit 
displacement. 
as 
described earlier in this chapter. 


~ 
~~ 
~: 
= 0 


I 
- 
I 
I 
L _ 
NEXT 
XCHG 


NEXT 
AL.7FH 


after the JL instruction has executed, the XCHG instruction 
will be executed if the Sign 
status is not equal to the Overflow status. The AND instruction 
is executed if the Sign 
status and the Overflow status are equal. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


JLE 
JNG 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed 
only if the Zero flag is set or the Sign flag is not equal to the Overflow flag; 
otherwise the next instruction 
is executed. 
The encoding for this instruction 
is: 


JLE 
disp 
-.- 
7E 


~ 
LThiS 
is an B-bit displacement byte. as 
described earlier in this chapter 


Z = 1 or 
~Z 
= 0 and 5 = 0 
5,,<0 
r--- - --- 
NEXT 
I 
A 
D 
AL.7FH 
I 
- 
I 
I 
L _ 
NEXT 
XCHG 
BX. IBP + Sl + OF631H) 


after the JL instruction, 
the XCHG instruction 
is executed if the Zero flag is 1 or if the 
Sign flag is not equal to the Overflow flag. The AND instruction 
is executed if the Zero 
status is 0 and the Sign status equals the Overflow status. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


Move the contents 
of the next two program memory bytes into the PC register. 


Move the contents of the succeeding two program memory bytes (bytes 4 and 5 of the 
instruction) 
into the CS register. Continue execution from this point. The previous pro- 
gram counter and Code Segment register contents 
are lost. 


The encoding for this instruction 
is: 


JMP 
addr 
-..- 
EA 
cr=High-order 8 bits of the new segment 


address. This byte is placed in the high- 
order byte of the CS register. 


Low-order 
byte of 
the 
new segment 


address. This byte is stored 
into the 


low-order byte of the CS register. 


High-order 
8 bits 
of 
the 
new 
offset 


address. This byte is moved in the· high- 
order byte of the program counter. 


Low-order 
8 bits 
of 
the 
new 
offset 


address. This byte is moved into the 
low-order byte of the program counter. 


Data 


Memory 


Program 
(Relative to the 


Memory 
CS Register) 


JMP addr 
Number of cycles: 15 


This instruction 
adds the contents 
of the second object code byte (taken as a 
signed 8-bit displacement) 
to the contents of the program counter plus 2; this becomes 
the offset address of the next instruction to be executed. Previous program counter con- 
tents are lost. The Code Segment register contents 
are unchanged. 


The encoding for this instruction 
is: 


JMP 
disp 
-.- 
EBT 
a-bit 
displacement byte, 
as described earlier in this chapter 


In the following instruction 
sequence 


JMP 
NEXT 
AND 
AL,7FH 


after the JMP instruction, 
the XOR instruction 
will be executed. The AND instruction 
will never 
be executed 
unless 
a Jump 
or Call instruction 
somewhere 
else in the 
sequence 
branches to this instruction. 


Data 
a 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 
~~I 
Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
EB 
ppppm 


BP 
kk 
ppppm + 1 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 


55 


Program Memory 
Address Calculation 


E5 


JMP 
kk 
Number of cycles: 
15 clocks 


1. 
Tris 
instruction 
uses Program 
Relative addressing, 
which is similar to Program 
Relative Paging as described in An Introduction to Microcomputers: Volume I - 
Basic 
Concepts (Osborne/McGraw-Hill, 
1978). The exception is that the program counter 
contents are incremented 
to point to the next instruction 
before the 8-bit signed dis- 
placement is added. 


Add the contents of the next two program memory bytes, treating them as a I6-bit 
unsigned 
displacement, 
to the program counter. 
Continue 
execution 
from this point. 


The previous program counter contents are lost. 


The encoding for this instruction 
is: 


JMP 
-..- 
E9L 


k 


k 


__~ 
jj 
High-order 
8 bits 01 the 16-bit 
displacement 


Low-order 
8 bits 01 the 16-bit 
displacement 


NEXT 
AL.7FH 


after the JMP instruction 
has executed, 
the STOS instruction 
will be executed. 
The 
AND instruction 
will never be executed unless a CALL or JMP instruction 
somewhere 
else in the instruction 
sequence 
refers to BRICKS as its operand. 


3-142 
The 8086 Book 


Data 


0 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 
~~I 


Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
E9 
ppppm 


BP 
kk 
ppppm 
+ 1 


51 
jj 
ppppm 
+ 2 


01 
ppppm 
+ 3 


PC 
mm 
mm 


C5 
nn 
nn 
Program 
Memory 


D5 
Address 
Calculation 


55 


E5 


JMP 
jjkk 


Number of cycles: 
15 


Move the word at the specified memory location into the program counter; move 
the succeeding word into the CS register. Continue 
execution from this point. Previous 
program counter and Code Segment register contents 
are lost. 


The encoding for this instruction 
is: 


JMP 
mem 
-.- 
FF 


Addressing mode bytelsl. As described 
earlier in this chapter 


Suppose that the OS register contains 700016, the 01 register contains 040416, the word 
at memory 
location 
7040416 is 100016, and the word at memory 
location 
7040616 is 
7EOOI6. After the instruction 


has executed, 
the program counter will contain 100016and the CS register will contain 
7EOOI6· Instruction 
execution will continue from location 7FOOOI6. 


0 
0 
T 
S 
Z 
A 
P 
C 


PSW 
~~I 


SP 


BP 


Sl 


01 
gg 
gg 


PC 
mm 
mm 


CS 
nn 
nn 


OS 
hh 
hh 


SS 


ES 


JMP 
(011 
Number of cycles: 
24 + EA intersegment 


Data 
(Relative to the 


Memory 
OS Register) 


yy 
rrrrg 


xx 
rrrrg + 


ww 
rrrrg + 2 


vv 
rrrrg + 3 


rrrrg + 4 


Program 
(Relative 
to the 


Memory 
CS Register) 


FF 
ppppm 


25 
ppppm + 


ppppm + 2 


ppppm + 3 


If the specified operand is a register, move the contents 
of the register into the 
program counter. 
If the specified operand is a memory location, move the contents 
of 
the memory 
location into the program counter. 
Continue 
execution 
from this point. 


Previous program counter contents 
are lost. The CS register is unchanged. 
The encoding for this instruction 
is: 


JMP 
mem/reg 
-.- 
FF 


Addressing 
mode byte(s). As described 


earlier in this chapter. 


has executed, 
the PC will contain 14A916 and execution will resume with 14A916 as the 
offset address for the next instruction. 


Data 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 


BX 
xx 
yy 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
FF 
ppppm 


BP 
E3 
ppppm + 


Sl 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
Program Memory 
Address Calculation 


SS 


ES 


Number of cycles: 
JMP BX: 11 through registers 
JMP [BX): 
18 + EA through memory 


Notes: 


1. 
This is an intrasegment 
indirect Jump. 
2. 
No registers or statuses are affected. 


JNE 
JNZ 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Zero flag is equal to 0; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JNE 
disp 
-.- 
75cr 


This 
IS an a-bit displacement byte, as 
described earlier in this chapter 
~:~-----;t~ 
= 1 


I 
- 
I 
I 
L. _ 
NEXT 
XCHG 


NEXT 
AL,7FH 


after the JNE instruction, 
the XCHG instruction 
will be executed if the Zero flag is O. 


The AND instruction 
will be executed if the Zero flag is 1. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 4 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 


executed only if the Overflow status is 0; otherwise the next instruction 
is executed. 
The encoding for this instruction 
is: 


JNO 
disp 
-.- 
71 


~ 
LThiS 
is an 8-blt displacement byte. as 
described earlier in this chapter 


r:~-----4;~, 


I 
- 
I 
I 
L _ 
NEXT 
XCHG 


NEXT 
AL,7FH 


after the JNO instruction, 
the XCHG instruction is executed if the Overflow status is O. 


The AND instruction 
is executed if the Overflow status is 1. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 4 


JNP 
JPO 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Parity flag is 0; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JNP 
disp 
-.- 
78 


~ 
This 
IS an a-bit displacement 
byte. as 
described earlier in this chapter 


r~~-----4~ 
= 1 


I 
- 


II 
L -NEXT 
XCHG 


after the JNP instruction, 
the XCHG instruction 
is executed if the Parity flag is O.The 
AND instruction 
is executed if the Parity flag is 1. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 


executed only if the Sign flag is 0; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JNS 
disp 
-.- 
79 


~ 
LThiS 
is an B-bit displacement 
byte. as 
described earlier in this chapter 


In the following instruction 
sequence 
f=~-----f' 


I 
I 
I 
L-NEXT 
XCHG 


NEXT 
AL.7FH 


after the JNS instruction 
is executed, 
the XCHG instruction 
executes if the Sign status 


is 0; otherwise the AND instruction 
executes. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Overflow flag is I; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JO 
disp 
--- 
70 


~ 


LThiS 
is an 8-bit displacement byte, as 
described earlier in this chapter. 


~:~-----~i: 
= 0 


I 
- 
I 
I 
I- _ 
NEXT 
XCHG 


NEXT 
AL,7FH 


after the JO instruction, 
the XCHG instruction will be executed if the Overflow status is 
1. If the Overflow status is 0, then the AND instruction 
will be executed. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 4 


JP 
JPE 
disp 
disp 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Parity flag is 1; otherwise the next instruction 
is executed. 


The encoding for this instruction 
is: 


JP 
disp 
-.- 
7A 


~ 
This 
IS an 8-blt displacement 
byte. as 


described earlier in this chapter 


f~~-----4: = 0 


I 
- 
I 
I 
L-NEXT 
XCHG 


after the JP instruction, 
the XCHG instruction will be executed if the Parity status is 1. If 
the Parity status is 0, then the AND instruction 
will be executed. 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


This instruction 
is identical to the JMP disp instruction 
except that the Jump is 
executed only if the Sign status is 1. 


The encoding for this instruction 
is: 


JS 
disp 
-.- 
78 


~ 
l- 
This 
IS an 8-blt displacement 
byte. as 
described earlier in this chapter 


In the following instruction 
sequence 
r'------~:., 


I 
II 
L~ 
NEXT 
XCHG 


NEXT 
AL,7FH 


after the JS instruction, 
the XCHG instruction will be executed if the Sign status is 1. If 
the Sign status is 0, then the AND instruction 
will be executed, 


Number of cycles: Jump is performed: 
16 
Jump is not performed: 
4 


L 


This instruction 
moves the low-order eight bits of the Flags register into the AH 
register. The eight bits that are moved are: 


76543210 


~ 


where X indicates an undetermined 
value. 
The encoding for this instruction 
is: 


LAHF 
-,- 
9F 


As an example, consider the case where the Carry and Parity flags are 1, the Zero, Sign, 
and Arithmetic 
flags are O. Executing an 


Data 


Memory 


AX 


BX 


ex 


DX 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


es 
nn 
nn 


DS 


SS 


ES 


Program 
(Relative 
to the 


Memory 
es Register! 


LAHF 
Number 
of cycles: 
4 


1. 
No statuses are affected. No registers except AH are affected. 


2. 
This instruction 
is used in conjunction 
with PUSH AX to emulate the 8080 instruc- 
tion PUSH PSW. 


8086 Code 


LAHF 
PUSH 
AX 


8080 Code 


PUSH PSW 


Load the contents of the specified memory word into the specified register. Load 
the contents 
of the memory 
word following the specified memory word into the DS 
register. 


This instruction's 
encoding is: 


LOS 
reg.mem 
-.- 
C5 


Mod and rim are used to specify 
the 
memory address. as described earlier in 
this chapter. 


3 
bits 
specifying 
the 
destination 
register. 
rrr = 000 for AX 
001 for CX 
010 for OX 
011 for BX 
100 for SP 
101 for BP 
110 for SI 
111 for 01 


As an example, consider the case where the DS register contains COOOI6,the word at 
memory 
location COOIO,6 contains 
018016 and the word at memory location C001216 


contains 200016, After the instruction 


8086 Assembly Language Instruction 
Set 
3-157 


Data 
(Relative to the 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
DS Register) 


PSW 


yy 
rrrrk 
~~I 


xx 
rrrrk + 1 


ww 
rrrrk + 2 


vv 
rrrrk + 3 


rrrrk + 4 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
C5 
ppppm 


BP 
36 
ppppm + 


SI 
kk 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


LDS 
SI,[jjkk) 
Number of cycles: 
16 + EA 


1. 
No statuses are affected. 


2. 
If mod is 11, then the operation performed 
by this instruction 
is undefined. 


Load the 16-bit offset address that is used to specify the memory operand into the 
specified register. 


The encoding for this instruction 
is: 


LEA 
reg,mem 
-.- 
80 


Mod and rim are used to specify 
the 
memory addressing option as described 
earlier in this chapter. 


3 
bits 
specifying 
the 
destination 
register. 
rrrr = 000 for AX 
001 for ex 
010 for OX 
011 for BX 
100 for 5P 
101 for BP 
110 for 51 
111 for 01 


Suppose that the DS register contains 280016,the BX register contains 040016and the SI 
register contains 003C16. After the instruction 


LEA 
BX, [BX + 51 + OF62H] 


has executed, 
the BX register will contain 139EI6, 
which is the sum of the contents 
of 
the BX and Sl registers and the specified displacement. 


0 
D 


PSW 


AX 


BX 
gg 


ex 


DX 


SP 


BP 


SI 
hh 


DI 


PC 
mm 


es 
nn 


DS 


SS 


ES 


LEA 
BX, [BX + Sl + jjkk] 
Number 
of cycles: 
2 + EA 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
es Register) 


8D 
ppppm 


98 
ppppm + 1 


1. 
No statuses are affected, 


2. 
If mod is 11, then the operation 
performed 
by this instruction 
is undefined. 


LES 
reg,mem 


Load Register and ES from Memory 


Load the contents 
of the specified memory word into the specified register. Load 
the contents 
of the memory 
word following the specified memory 
word into the ES 
register. 


The encoding for this instruction 
is: 


LES 
reg.mem 
C4 


Mod and rim are used to specify the 
memory address as described in this 
chapter. 


3 
bits 
specifying 
the 
destination 
register. 
rrr = 000 for AX 
001 for CS 
010 for OX 
011 for BX 
100 for SP 
101 for BP 
110 for SI 
111 for 01 


Suppose that the DS register contains 
BOOOI6, the BX register contains 
080A16, 
the 
memory word at location B080AI6 is 05A216 and the memory word at B080CI6 is 400016, 
After the instruction 


LES 
OI.[BX] 


has executed, the DI register will contain 05A216 and the ES register will contain 400016, 


.LES 
DUBX] 
Number of cycles: 
16 + EA 


Notes: 


1. 
No statuses are affected. 


2. 
If mod is 11, then the operation performed 
by this instruction 
is undefined. 


3. 
The register specified in this instruction 
is typically the DI register, since the 
DI register is the register normally associated with the ES register. 


This instruction 
is used to force the 8086 to output the LOCK signal low. The 


. LOCK signal is held low for the duration of the next instruction. 


This instruction 
is considered 
to be Ii prefix instruction, 
i.e., it precedes 
the 
instruction 
for which the LOCK signal is to be asserted. 
The encoding for this instruction 
is: 


LOCK 
-.- 
FO 


AXI 
BX ------------- 
CX ------------- 
ox ------------- 


The LOCK signal is 
low for the 
duration of 
the execution of 
the instruction 
at ppppm + 1 


LOCK 
Number of cycles: 
2 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


1. 
This prefix may be used to preface any 8086 instruction. 
If, however, this prefix is 


used in conjunction 
with the REP prefix and a string primitive, 
certain problems 


may result. For a discussion of this topic, refer to the next chapter. 


2. 
This prefix is very useful in the implementation 
of test-and-set 
sequences. 


Move from the memory 
location addressed 
by the SI register to the AL (8-bit 
operation) 
or the AX (I6-bit operation) 
register. The SI register is incremented/decre- 
mented depending 
on the value of the OF flag. 
The encoding for this instruction 
is: 


LOos 


101011 
Ow I 
Lw=o 
8 bits are transferred. 


If OF = O. the SI register is 
incremented by 1; otherwise it 
is decremented by 1 


w = 1 
16 bits are transferred. 


If OF = O. the SI register is 
incremented by 2; otherwise it 
is decremented by 2. 


For example, 
suppose 
that the OF flag is 0, the SI register contains 
003516, 
the OS 
register contains 
400816, 
and the byte at memory 
location 4008516 
is OF16. After the 
instruction 


has executed, 
the contents 
of the AL register will be OF16 and the contents 
of the SI 
register will be 003616. 


0 
0 


PSW 


AX 


BX 


ex 


DX 


SP 


BP 


Sl 
gg 


DI 


PC 
mm 


es 
nn 


DS 
hh 


SS 


ES 


Data 
(Relative to the 


Memory 
DS Register! 


Program 
(Relative 
to the 


Memory 
es Register! 


LODS 
BYTE 
Number of cyles: 
12 for single OccurrenCe 
9 + (13 times repetition if preceded by REP prefixl 


Notes: 


1. 
No statuses are affected. 


2. 
The default segment 
register is the DS register. This may be overridden 
by the 
appropriate 
segment override prefix. 


3. 
Typically, the REP prefix is not used with this instruction. 


4. 
As with other 8086 operations, 
some symbol must be given to the assembler 
to 
allow the assembler 
to determine 
whether 
an 8- or 16-bit operation 
will be per- 
formed. This subject will be discussed later in this chapter. 


This instruction 
decrements 
the ex register 
(not affecting the flags) and then 
functions in the same manner as the JMP disp instruction, 
except that if the ex register 
has not been decremented 
to 0, then the Jump is executed; 
otherwise the next instruc- 
tion is executed. 
The encoding for this instruction 
is: 


LOOP 
disp 
-..- 
E2 


~ 
This is an a-bit displacement 
byte. as 
described earlier in this chapter 


} Calculate payroll sum here 


LOOP 
PAYROLL$SUMMATION$ARRAY 


The sequence of instructions 
between PAYROLL$SUMMA TION$ARRA Y and 
the LOOP instruction 
will be executed LENGTH$OF$PA 
YROLL$ARRA Y times. 


Number of cycles: Jump is performed: 
11 
Jump is not performed: 
5 


LOOPZ 
LOOPE 
disp 
disp 


This instruction 
decrements 
the ex register 
(not affecting the flags) and then 
functions in the same manner as the JMP disp instruction, 
except that if the ex register 
has not been decremented 
to 0 and the Zero flag is 1 then the Jump is executed; other- 
wise the next instruction 
is executed. 


The encoding for this instruction 
is: 


LOOPZ 
disp 
'-v-" 


El 
c:E=J 
LThiS 
is an B-bit 
displacement 
byte 
as 


described 
earlier in this chapter. 


MOV 
MOV 
MOV 


IN 
INC 
XCHG 
XCHG 


IN 
INC 
XCHG 
CMP 
LOOPE 
JNZ 


CX.NUMBER$OF$PORTS 
OX,MAIN$PORT$GROUP 
BX.REOUNOANT$PORT$GROUP 


AX,OX 
OX 
BX,OX 
AX,BP 


AX,OX 
OX 
BX,OX 
AX,BP 
TOP 
PORT$OISPUTE 


The sequence of instructions 
between TOP and the LOOPE instruction 
compare 
data 
available 
at 
two 
sets 
of 
input 
ports; 
one 
group 
is 
pointed 
to 
by 
MAIN$PORT$GROUP, 
and 
the 
other 
group 
is 
pointed 
to 
by 
REDUNDANT$PORT$GROUP. 
The 
instruction 
JNZ 
PORT$DISPUTE 
will 
be 
executed after one of two scenarios has occurred: 


I. 
A comparison 
has resulted in the Zero flag being set to 0, in which case the data at 
the ports is not equal. 


2. 
The instructions 
between TOP and LOOPE have executed NUMBER$OF$PORTS 
times. 


Number 
of cycles: 
Jump 
is performed: 
1B 
Jump 
is not performed: 
6 


LOOPNZ 
LOOPNE 
disp 
disp 


This instruction 
decrements 
the CX register 
(not affecting the flags) and then 
functions in the same manner as the JMP disp instruction, 
except that if the CX register 
has not been decremented 
to 0 and the Zero flag is 0, then the Jump is executed; other- 
wise the next instruction 
is executed. 
The encoding for this instruction 
is: 


LOOPNZ 
disp 
------ 
EO 


~ 
L 
This is an a-bit 
displacement 
byte as 
described earlier in this chapter 


As ari example, consider the following sequence of instructions: 


MOV 
SI.ELEMENT$TO$MATCH 
LES 
DI 
MOV 
CX.NUMBER OF ENTRIES 


;SEARCH FOR MATCH 


LOOPNE 
SEARCH$FOR$MATCH 


The code between 
the SEARCH$FOR$MA 
TCH instruction 
and the LOOPNE 
instruction will be executed until 1) CX is decremented 
to 0, or 2) the instruction before 
LOOPNE sets the Zero flag to I; e.g., the Zero status might be I if a match is found. 


Number of cycles: 
Jump is performed: 
19 
Jump is not performed: 
5 


Register to Register 
Memory to Register 
Register to Memory 


This instruction 
is used to move 8- or 16-bit data elements between a register and 


a register or memory location. 


The encoding for this instruction 
is: 


MOV 
SRC,DEST 


1 0 0 
1 0 d w I I 
mod reg rim 
I 


~ 
Addressing mode byte as described 
earlier in this chapter. 


'---------w 
= 0 
8-bit move 
w = 1 
16-bit move 


'----------d 
is the Direction flag. If d = 0, the 


operand described by mod and rim 
is 


memlreg 1 and the operand described 
by reg is mem/reg2' 
If d = 1, then the operand described by 
mod 
and 
rim 
is mem/reg2 
and the 


operand described by reg is memlreg 
1. 


0 
D 


PSW 
~~I 


SP 


BP 


SI 


DI 


PC 


CS 


DS 


SS 


ES 


Data 


Memory 


===_:: ===_:: p 


Program 
(Relative 
to the 


Memory 
CS Register) 


MOV 
AX,eX 


Number of cycles: 


register to register: 
2 
memory to register: 
8 + EA 
register to memory: 
9 + EA 


Notes: 


1. 
Segment registers may not be specified in this instruction. 
To move data to/from 
segment 
registers, consult the MaV segreg,reg or MaV reg,segreg operations. 


2. 
No statuses are affected. 


3. 
This 
instruction 
performs 
the 
function 
that 
the 
MaV 
reg,reg 
instruction 
accomplished 
in the 8080 assembly instruction. 
This instruction 
does, however, 
provide for more in the way of flexibility than the corresponding 
8080 instruction. 


MOV 
reg,data 


Load Immediate 
Data into Register 


This instruction 
is used to load 8- or 16-bit data elements 
into a register 
via 
immediate 
addressing. 
The encoding for this instruction 
is: 


MOV 
reg.data 


1011 
w.:.:-.1r lkk_~_jj . 
L- JJ ISthe high-order 8-blts of a 16-blt Immediate 


operand. This byte is only present if w = 1. 


kk is the low-order 8 bits of the immediate operand. 
This byte is always present. 


rrr is 3 bits which select the register which will be 
destination for the immediate operand. The 
interpretation 
of rrr depends on the value of w. 


8-bit operation 
rrr = 000 for AL 
001 for CL 
010 for DL 
011 for BL 
100 for AH 
101 for CH 
110 for DH 
111 for BH 


16-bit operation 
rrr = 000 for AX 


001 for CX 
010 for DX 
011 for BX 
100 for 5P 
101 for BP 
110 for 51 
111 for DI 


w = 0 
8-bit move. jj is not present 
w = 1 
16-bit move. jj is present 


0 
D 


PSW 


AX 


BX 


ex 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


es 
nn 


DS 


SS 


ES 


Data 


. Memory 


Program 
(Relative to the 


Memory 
es Reglsterl 


MOV 
ex, tikk 
Number of cycles: 4 


Notes: 


1. 
Segment registers may not be loaded via this instruction. 
To load immediate 
data 
into a segment 
register, consult the MOY segreg,mem/reg 
instruction. 


2. 
This instruction 
performs the function that MYI (8-bit move immediate) 
and LXI 


06-bit 
move immediate) 
instructions 
perform for the 8080. 


3. 
No statuses are affected. 


This instruction 
is used to move 8- or 16-bit data elements from a memory loca- 
tion to the accumulator. 


The encoding for this instruction 
is: 


MOV 
ac.mem T,..'" 
"-"'on••••,,~, ••;,•• 


low-order a bits of the address. jj is the 
high-order a bits of the address. 


w = 0 
AL is the destination 
la-bit operation!. 


w = 1 
AX is the destination 


(16-bit operation!. 


moves the contents of memory location 106416 (relative to the DS register) into the AL 
register. 


0 
D 


PSW 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


CS 
nn 


DS 
hh 


SS 


ES 


Data 


Memory 
(Relative to the 
DS Register) 


Program 
(Relative 
to the 


Memory 
CS Register! 


MOV 
AL,[jjkkl 
Number of cycles: 
10 


1. 
This instruction 
performs the same function as the 8080 instruction 
LDA addr. In 
addition, 
this instruction 
allows a 16-bit load to the AX register. 


This instruction 
is used to move 8- or 16-bit data elements from the accumulator 
to a memory location. 
The encoding for this instruction 
is: 
I.".,,".", 
""" 
"',," 
""", 
low-order 8 bits of the address. jj is the 
high-order 8-bits of the address. 


w : 
0 
AL is the source. 


(8-bit operation) 
w : 
1 
AX is the source. 


(16-bit operation 


moves the contents of the AX register into memory location 106416 (relative to the DS 
register). The contents of AL are moved into 106416 and the contents of AH are moved 
into 106516, 


8086 
Assembly 
Language 
Instruction 
Set 
3-175 


Data 
(Relative to the 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
DS Register! 


PSW 
yy 
ggggk 


xx 
ggggk + 1 


AX 
xx 
yy 
ggggk + 2 


BX 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
A3 
ppppm 


BP 
kk 
ppppm + 


SI 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
hh 
hh 


SS 


ES 


MOV [jjkkl.AX 
Number of cycles: 
10 


Notes: 


1. 
No statuses are affected. 


2. 
This instruction 
performs the same function as the 8080 instruction 
STA addr. In 
addition, this instruction 
allows a 16-bit store of the AX register. 


Move a 16-bit data element from a register or memory location into a segment 


register. 


The encoding for this instruction 
is: 


MOV 
segreg,mem/reg 
-,- 
BE 


mod and rim are used to specify 
the 
memory/register 
operand as described 
earlier in this chapter. 


reg is two bits which specify the 
segment registers 
rr = 00 for E5 
01 for C5 
10 for 55 
11 for D5 


0 
D 


P5W 


AX 


BX 


CX 


DX 


5P 


BP 


51 


DI 


PC 


C5 


D5 


55 


E5 


Data 


Memory 


xx 
yy 


Program 
(Relative 
to the 


Memory 
C5 Register) 


BE 
ppppm 


02 
ppppm + 1 


ppppm + 2 


ppppm + 3 


mm 
nn 


nn 
nn 
Program Memory 
Address Calculation 


MOV 
55,DX 
Number of cycles: register to register: 2 
memory to register: B + EA 


Notes: 


1. 
If reg = 01, then the results of this operation 
are undefined, 
This prohibition 
pre- 


vents the user from storing directly into the CS register. 


2. 
This instruction 
is typically used in initialization sequences where the program seg- 
ment areas are defined. 


Move a 16-bit data element 
from a segment 
register into a register or memory 
location. 
The encoding for this instruction 
is: 


MOV 
mem/reg, segreg 
-,- 
8C 


mod and rim 
are used to specify 
the 
memory/register 
operand as described 
earlier in this chapter 


reg is two bits which specify the 
seg ment register 
rr = 00 for ES 
01 for CS 
10 for SS 
11 for DS 


For example, 
consider the case where the DS register contains 200016, 
Executing the 
instruction 


a 
a 


P5W 
~~I 


5P 


BP 


51 


DI 


PC 
mm 


C5 
nn 


D5 
rr 


55 


E5 


Data 
Memory 


(Relative to the 
D5 Register) 


Program 
(Relative 
to the 


Memory 
C5 Register) 


Number of cycles: 
Register to Register: 
2 
Register to Memory: 
9 + EA 


\'I0tes: 


L 
This is not a general purpose register-to-register 
MOY; this is for moving segment 
registers. For a general purpose register MOY, consult MOY mem/reg\>mem/reg2. 


Move the immediate 
data in the bytes following the op-code to the specified 
register or memory location. 8- or 16-bit data transfer may be specified. 


The encoding for this instruction 
is:c:o 
L 
jj is the high-order 8 bits of the 16-bit 
immediate operand. This byte is only 
present if w = 1. 


kk is the low-order 8 bits of the immedi- 
ate operand. This byte is always pre- 
sent. 


Addressing mode byte(s) as described 
earlier in this chapter. 


w = 0 
8-bit operation 


w = 1 
16-bit operation 


For example, 
consider 
the case where the DS register contains 
DOOOl6and the BX 
register contains 001616, After the instruction 


MOV 
8X,491 FH 


has 
executed, 
memory 
location 
D001616 
will contain 
IF16 
and 
memory 
location 
D001716will contain 4916, 


0 
D 


PSW 


AX 


BX 


ex 


DX 


SP 


BP 


Sl 


DI 


PC 


CS 


DS 


55 


:S 


~otes: 


~. 
No statuses are affected. 


Data 
(Relative to the 


Memory 
DS Register! 


r 
r 


Program 
Memory 


C7 


07 
\... 
kk 
\ 
jj 


(Relative 
to the 
CS Register! 


The segment registers may not be specified in this instruction. 


This instruction 
is not typically used to move immediate data into the registers. The 
instruction 
MOV reg,data is provided for this purpose. 


MOV 
[BXJ.jjkk 
Number of cycles: 
10 + EA 


Move 8 or 16 bits from the memory location pointed to by the SI register to the 
memory location pointed to by the DI register. The SI and DI registers are increment ed/ 
decremented 
depending 
on the value of the DF flag. 


The encoding for this instruction 
is: 


MOVS 


11010010;:J L w = 0 8 bits are transferred. If OF = O. 


the SI and 01 registers are incremented 
by 
1; 
otherwise 
both 
registers 
are 
decremented by 1. 


w = 1 16 bits are transferred. If OF = O. 
the SI and 01 registers are incremented 
by 
2; 
otherwise 
both 
registers 
are 


decremented by 2. 


Consider the case where the DF flag is 0, the DS register contains 100016, the ES register 
contains 178016, the SI register contains 000616, the DI register contains 000616, and the 
word at memory location 1000616 if 8FOBI6. After the instruction 


MOVS 
WORD 


has executed, 
memory location 1780616 will contain 8FOBI6, the SI register will contain 
000816 and the DI register will contain 000816• 


0 
0 
T 
S 
Z 
A 
P 
C 


PSW 
~~I 


SP 


BP 


SI 
gg 
gg 


01 
qq 
qq 


PC 
mm 
mm 


CS 
nn 
nn 


OS 
hh 
hh 


SS 


ES 
tt 
tt 


Data 
Memory 


(Relative to the 
OS Register) 


Data 
Memory 


(Relative to the 
ES Register) 


Program 
(Relative to the 


Memory 
CS Register! 


~ppppm 


HPpppm+l 


• Program M emory Address Calculation 
•• Destination 
Data Memory 
Address Calculation 


••• Source Data Memory 
Address Calculation 


MOVS 
Number of cycles: 
1B for single occurrence 
9 + (17 times repetition when preceded by REP prefixl 


Notes: 


1. 
No statuses are affected. 


2. 
The default segment register for the source operand is the DS register. This segment 
may be overridden 
using segment 
prefixes. The default segment 
register for the 
destination 
operand is the ES register. This segment 
may not be overridden 
using 
segment prefixes. 


3. 
The REP prefix and/or the LOCK prefix may be used with this instruction. 
Using 
the REP and the LOCK prefixes in conjunction 
with this instruction 
may cause 
problems. 
Consult 
the next chapter for a complete 
discussion 
of these potential 
difficulties. 


4. 
This instruction 
is very useful for moving blocks of memory. Consider the following 
sequence of instructions: 


LES 
DI. eURRENT$START$OF$PRINT$BUFFER 
MOV 
SI. PAGE$HEADER$MESSAGE 
MOV 
ex. 
PAGE$HEADER$MESSAGE$LENGTH 
REP 
MOVS 
BYTE 


These instructions 
would move the data from the memory 
location addressed 
by 
PAGE$HEADER$MESSAGE 
to the memory location addressed by the contents of 
CURRENT$ST ART$OF$PRINT$BUFFER. 


5. 
How do you specify whether an 8- or 16-bit transfer is to be performed? 
This will 
depend on your assembler. 
For a discussion of this subject, refer to the end of this 
chapter. 


Multiply the specified register 
or memory 
location contents 
by the AL (8-bit 
operation) 
or AX 06-bit 
operation) 
register, considering 
both operands 
as unsigned 
numbers, 
i.e., a simple binary multiplication. 
If an 8-bit operation is performed, the low- 
order eight bits of the result are stored in the AL register, the high-order eight bits of the 
result are stored in the AH register. Ifa 16-bit operation is performed, 
the low-order 16 
bits of the result are stored in the AX register, the high-order 
16 bits of the result are 
stored in the DX register. In either case, if the high-order half of the result is 0, then the 
OF and CF flags are set to 0; otherwise they are set to 1 to indicate significant digits in 
AX or DX. 


The encoding for this instruction 
is: 


MUL 


1111011 
w 


Addressing 
mode byte (s) as described 
earlier in this chapter 


w = 0 
8-bit operation 
w = 1 
16-bit operation 


As an example, 
consider the case where the AX register contains 451416 and the CL 
register contains 0516, After the instruction 


MUL 
AL.CL 


has executed, 
the AX register will contain 006416 and the Carry and Overflow flags will 
be O. 


0 
0 


PSW 
~~I 


SP 


BP 


SI 


01 


PC 
mm 


CS 
nn 


OS 


SS 


ES 


Data 


Memory 


xx 
~ 


=_-==_ -==_ -==_ -==_ -==_ -==_ -:=_-==_ -==_ -==_y=_y=_-==_ -==_-:I~",0 
Program 
(Relative to the 


Memory 
CS Register! 


MUL 
AL,CL 
Number of cycles: 
8-bit memory multiply: (76-83) + EA 
16-bit memory multiply: (124-139) 
+ EA 
8-bit register multiply: 70-77 
16-bit register multiply: 118-133 


Notes: 


1. 
This is the unsigned 
number 
multiply operation. 
Both operands 
are treated 
as 


unsigned binary numbers 
in the range: 


For a signed multiply operation, 
consult the instruction 
IMUL. 


2. 
In some cases, it may be more appropriate to use shifts to perform multiplications. 
These cases would occur when memory conservation 
is not of paramount 
impor- 
tance and speed is necessary. 


This instruction 
performs a twos complement 
subtraction 
of the specified operand 
from zero. The result is stored in the specified operand. An 8- or 16-bit operand may be 
specified. 


The encoding for this instruction 
is: 


NEG 


1111011 
w 


mem/reg 


~ 
L-Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
a-bit operation 
w = 1 
16-bit operation 


Suppose that the BX register contains 000616, 
the DS register contains 
180016, and the 
contents of memory location 1800616 are 4716• After the execution of the instruction 


NEG 
[BX) 


the contents of memory location 1800616 will be B916. 


3-188 
The 8086 Book 


Data 
(Relative to the 


0 
D 
T 
5 
Z 
A 
P 
C 
Memory 
DS Register) 


PSW 


xx 
ggggr 


AX 
ggggr + 1 


BX 
rr 
rr 


CX 


DX 


Program 
(Relative 
to the 


Memory 
C5 Register) 


SP 
F6 
ppppm 


BP 
IF 
ppppm 
+ 


51 
ppppm 
+ 2 


DI 
ppppm 
+ 3 


PC 
mm 
mm 


Program 
Memory 


CS 
nn 
nn 
Address 
Calculation 


DS 
It 
It 


SS 


ES 


NEG 
[BX) 


Number 
of cycles: 
Memory 
operand: 
3 
Register operand: 
16 + EA 


1. 
There is no equivalent 
instruction 
in the 8080 assembly language. An equivalent 


8080 sequence for this instruction 
for a 16-bit quantity would be 


MOV 
A.D 
CMA 
MOV 
D.A 
MOV 
A.E 
CMA 
MOV 
E.A 
INX 
0 


No operation 
is performed. 
The encoding for this instruction 
is: 


NOP 
-...- 
gO 


Data 


0 
0 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 
~~I 
Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
gO 
ppppm 


BP 
ppppm + 


SI 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 
Program Memory 
OS 
Address Calculation 


SS 


ES 


NOP 
Number of cycles: 3 


Complement 
the contents 
of the specified register or memory location. 


The encoding for this instruction 
is: 


NOT 
mem/reg 


11111011.,;,1 
I 


Addressing mode byte(s) as described 


~ 
:r~e~ in t:~sb~h::~:~tion 


w = 1 
16-bit operation 


NOT 
BL 


has executed, the BL register will contain 0416, 


0 
0 


PSW 
~~I 


SP 


BP 


SI 


01 


PC 
mm 


CS 
nn 


OS 


SS 


ES 


_== 
_====_u FC> ~ 


NOT 
BL 
Number of cycles: Memory operand: 
16 + EA 
Register operand: 3 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


1. 
No statuses are affected. 


2. 
This instruction 
performs 
the same function 
as the 8080 instruction 
CMA. This 
instruction 
also allows 16-bit complements 
and complementing 
of any general pur- 
pose register or memory location. 


OR 
ac,data 


OR Immediate Data with the AX or AL Register 


OR the immediate 
data in the succeeding program memory byte(s) with the AL 
(8-bit operation) 
or AX 06-bit 
operation) 
register. 
The encoding for this instruction 
is: 


OR 
aC,dataL 


k 


k 
_I 
_li 
I 
L The high-order a bits of the immediate 


operand. This byte is only present if w = 1. 


The low-order a bits of the immediate 
operand. This byte is always present. 


w = a a-bit operation. 
AL is operand 
w = 1 16-bit operation 
AX is operand 


Suppose that the AX register contains 060916, After the instruction 


OR 
AX,3030H 


has executed, the AX register will contain 363916, 


060916 
= 0000 
0110 
0000 
1001 


303016 
= 0011 
0000 
0011 
0000 


0011 
0110 
0011 
1001 


1 
L4 one bits, set P to 1 


AF is undefined 
Overflow is cleared to a 


. ------------Sign 
is set to a 
Carry is cleared to a 


Non-zero result, Z is set to a 


1. 
This instruction 
performs the same function as the 8080 instruction 
ORI data. This 
instruction 
also has the ability to perform I6-bit operations. 


2. 
If you desire to OR immediate data with any of the other general purpose registers 
or with some memory location, consult the instruction 
OR mem/reg,data. 


OR 
mem/reg,data 


OR Immediate Data with Register or Memory Location 


OR the immediate 
data in the succeeding 
program 
memory 
byte(s) 
with the 
specified register or memory location. An 8- or 16-bit operation 
may be specified. 


The encoding for this instruction 
is: 


c=z=:J 
l- The high-order byte of the immediate 


operand.This byte is only present if w = 1. 


The low-order byte of the immediate 
operand. This byte is always present. 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit operation 


w = 1 
16-bit operation 


Consider the case where the DS register contains 380016, the contents of the BX register 
are 020016, 
the DI register contains 013616, and the word at memory location 3833616 is 
06B316• After the instruction 


068316=0000011010110011 
050516=0000 
1000 
0000 
0101 


0000 
1110 
1011 
0111L 
6 one bits. set P to 1 


Carry is cleared to 0 


-----------5ign 
is set to 0 


Overflow is cleared to 0 


Non-zero result, set Z to 0 


8086 Assembly Language Instruction 
Set 
3-195 


Data 
(Relative to the 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
DS Register) 


PSW 


yy 
tttt 


AX 
xx 
tttt + 1 


BX 
gg 
gg 


CX 


DX 


Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
81 
ppppm 


BP 
09 
ppppm + 1 


SI 
kk 
ppppm + 2 


DI 
hh 
hh 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
rr 
rr 
Program Memory 


SS 
Address Calculation 


ES 


OR [BX + DII. likk 


Number of cycles: 
Memory operand: 
17 + EA 
Register operand: 4 


1. 
This instruction 
is not typically used to OR immediate 
data with the AX or AL 


register. The instruction 
OR ac,data is provided for this purpose. 


OR 
memreg" 
mem/regz 


OR: 
Register with Register 
Register with Memory 
Memory with Register 


OR the contents 
of the register or memory location specified by mem/reg2 
with 
the contents 
of the register or memory location specified by mem/regl, 
returning 
the 
result to mem/regl. 
An 8- or 16-bit operation 
may be specified. Either mem/regl 
or 
mem/reg2 
may be a memory 
operand, 
but one of the operands 
must be a register 
operand. 
The encoding for this instruction 
is: 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
S-bit operation 
w = 1 
16-bit operation 


d is the Direction flag. If d = O. then the 
operand described by mod and rim 
is 


mem/reg 1 and the operand described 
by reg is mem/reg2' 
If d = 1. then the 


operand described by mod and rim 
is 


mem/reg2 
and the operand described 
by reg is mem/regl' 


Suppose that the AX register contains 006016, the DS register contains 400016, the BX 
register contains 
009A16, 
and the word at memory location 4009AI6 contains 012C16. 


After the instruction 


has executed, 
the contents of the word at memory location 4009AI6 will be 016C16• The 
flags will be set as follows: 


00601 6 = 0000 
0000 
011 0 
0000 


050516=0000 
0001 
0010 
1100 


0000 
0001 
0110~1100 


I 


t 
1 
4 one bits. set P to 1 
AF is undefined 
Overflow is cleared to 0 
----------- 
Sign is set to 0 
Carry is cleared to 0 
Non-zero result. set Z to 0 


Number of cycles: Register to Memory: 
16 + EA 
Memory to Register: 9 + EA 
Register to Register: 3 


Output 8- or 16-bit data elements from the AL (8-bit) or AX (l6-bit) 
register to 
the 1/0 port identified by the contents 
of the DX register. 


The encoding for this instruction 
is: 


OUT 
DX,ae 


11 
1 10111 
7J 
Lw = 0 
8-bit data transfer from AL 


w = 1 
16-bit data transfer from AX 


As an example, consider the case where the DX register contains OFFF2 
16 and the AL 
register 4016, The execution of an 


instruction 
will result in the quantity 4016 being loaded into the I/O buffer at 1/0 port 
number OFFF216, 
. 


0 
D 


PSW 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


CS 
nn 


DS 


SS 


ES 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register) 


1. 
This instruction 
allows the user to access I/O ports which have been assigned 
addresses between 0 and OFFFFI6• 


2. 
No registers or statuses are affected. 


This instruction 
outputs 
8- or 16-bit data elements 
from the AL (8-bit) or AX 
06-bit) 
register to the I/O port identified by the second byte of the instruction. 


The encoding for this instruction 
is: 


11110011WL_1 
_t 
I 


yy is a bits specifying the 1/0 port 


w = 0 
a-bit data transfer to AL 


w = 1 16-bit data transfer to AX 


No registers or statuses are affected. 
Suppose that the AX register contains 58A416• Executing an 


OUT 
14H.AX 


instruction 
will transfer A416 to the I/O port addressed at 1416 and 5816 to the I/O port 
addressed at 1516, 


1. 
This instruction 
allows the user to access I/O ports which have been assigned 
addresses 
between 0 and FFI6. 
To address ports whose addresses are outside this 
range, consult the instruction 
OUT DX,ac. 


2. 
This instruction 
performs 
the same function 
as the 8080 instruction 
OUT port. 


Additionally, 
this instruction 
allows for 16-bit data transfers in a single instruction 
(not possible using the 8080 instruction 
OUT port). 


3. 
To effectively use the OUT instruction, 
a firm grasp of the hardware configuration 
is 
necessary. The way in which the I/O logic has been implemented 
determines 
the 
port addresses that are used to access various hardware functions. It is also possible 
to design 
a microcomputer 
system 
that accesses 
external 
logic using memory 
reference instructions 
with specific memory addresses. 


Pop the two top stack bytes into the specified memory location or register. This is 
a 16-bit operation. 


The encoding is: 


Addressing 
mode byte(s) as described 
earlier in this chapter 


Suppose that the DS register contains 
FFOOI6, the SI register contains 000816, 
the SP 
register contains OFEAI6, the SS register contains 2FOOI6, and the word stored at loca- 
tion 2FFEAI6 is 3BC516. After the instruction 


pop [51] 


has executed, 
the contents of memory location FF00816 will be C516 and the contents of 
memory location FF00916 will be 3B16• SP will be equal to OFECI6• 


AXI 
ax ~------~------- 
CX --------- ..•...------- 
DX -------_ ..•.._------ 


• Data Memory Address Calculation 


•• Program Memory Address Calculation 


••• Stack Data Memory Address Calculation 


POP 
[SII 
Number of cycles: 
Memory operand: 
17 + EA 
Register operand: 
8 


Data 
Memory 


Data 
Memory 


(Relative to the 
DS Register) 


(Relative to the 
SS Register) 


Program 
(Relative 
to the 


Memory 
CS Register) 


1. 
This instruction 
is not typically used to pop data into a register. The instruction 
POP 


reg performs this function a!1d only occupies one byte of program memory space. 


2. 
No statuses are affected. 


POP 
reg 


Read from the Top of the Stack 


Pop the two top stack bytes into the designated 
16-bit register. 


The encoding is: 


POP 
reg 
I a 1 a 1 1 r r r I 
L3 bits which specify the 16-bit register 
into which the data is to be popped 


rrrr = 000 for AX 
001 for ex 
010 for DX 
all 
for BX 
100 for SP 
101 for BP 
110 for SI 
111 for DI 


POP 
BX 


This instruction 
would pop the byte pointed to by the stack pointer (and stack segment) 
into BL, then increment 
the stack pointer and pop the addressed byte into BH. Finally, 
the stack pointer would again be incremented 
by 1 to point at the new top of stack. This 
is actually done with a single 16-bit transfer in the 8086. 


8086 Assembly Language Instruction 
Set 
3-205 


Data 
(Relative to the 
a 
0 
T 
S 
Z 
A 
P 
C 
Memory 
SS Register) 


P5W 


yy 
uuuus 


AX 
xx 
uuuus 
+ 1 


BX 
xx 
yy 
uuuus + 2 


CX 


OX 


Program 
(Relative 
to the 


Memory 
CS Register) 


5P 
55 
55 
5B 
ppppm 


BP 
ppppm + 1 


51 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 
Stack Memory 
Address Calculation 


C5 
nn 
nn 


05 


55 
tt 
tt 


E5 


POP 
BX 
Number of cycles: 
B 


Notes: 


1. 
This instruction 
cannot be used to pop data elements into the segment registers. To 
pop data into segment 
registers, consider the instruction 
POP segreg. 


2. 
For this instruction 
to be meaningful, 
it is of course necessary to have: 


a. an initialized stack pointer 


b. data already on the stack via a PUSH instruction. 


Naturally, one could use this instruction for the sole purpose of incrementing 
the SP 
register by 2; however, this is not recommended. 


3. 
This instruction 
performs the same function as the 8080 assembly language instruc- 
tion POP reg. 


This instruction 
pops the two top stack bytes into the designated 
16-bit segment 
register. 


The encoding is: 


POP 
segreg 


000rr111 


••••••L- 2 bits specifying the 16-bit segment 
register into which the data is to be popped 


rr = 00 for ES 
10 for SS 
11 for DS 


for example, will pop the two top stack bytes into the ES register. Undefined operation if 
rr = 01. 


8086 Assembly Language Instruction 
Set 
3-207 


Data 
(Relative to the 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
SS Register) 


PSW 


yy 
uuuus 
~~I 


xx 
uuuus + 1 


uuuus + 2 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
ss 
ss 
07 
ppppm 


BP 
ppppm 
+ 1 


SI 
ppppm 
+ 2 


01 
ppppm 
+ 3 


PC 
mm 
·mm 


CS 
nn 
nn 


DS 


SS 
11 
11 


ES 


POP 
ES 
Number 
of cycles: 8 


Notes: 


1. 
This instruction 
only pops data into the segment 
registers. To pop data into the 
8086's other registers, consider the instruction 
POP reg. 


2. 
For a more complete description of the function performed by a POP, consult POP 
reg. 


Pop the two top stack bytes into the Flags register. The first byte popped goes into 
the low-order byte of the Flags register. The format of the low-order byte of the Flags 
register is: 


7 
6 
5 
4 
3 
2 
1 
0"'- 
BitNo. 


~ 


The second byte popped is stored into the high-order byte of the Flags register. The for- 
mat for this byte is: 


15 
14 
13 12 
11 
10 
9 
8 ,,--Bit 
No. 


~ 


The encoding for this instruction 
is: 


Consider, 
for example, the situation where the two top bytes on the stack are 4F (top- 
most) and 32. Executing the 


instruction 
will result in the Carry, Parity, Zero, and Interrupt 
flags being set to 1. All 
other flags will be set to O. 


Data 
(Relative to the 


Memory 
55 Register) 


;~I,--:_==-_==-_=:-_== -_==-_= 
Program 
(Relative 
to the 


Memory 
C5 Register) 


Program Memory 
Address Calculation 


POPF 
Number of cycles: 8 


1. 
As with all stack operations, 
it is important 
that the stack pointer be initialized. 
In 
addition, it would be appropriate to have executed a PUSHF instruction, 
in order to 
store the value of the flags, before executing a POPF instruction. 


2. 
This instruction 
performs some of the functions of the 8080 instruction PUSH PSW. 


This instruction 
pushes the contents of the specified register or memory location 
onto the top of the stack. This is a 16-bit push operation. 
The encoding is: 


PUSH 
mem/reg 


FF 


Addressing 
mode bytelsl 
as described earlier in this chapter 


For example, if the DS register contains 280016, the BX register contains 040016, the SP 
register contains 100016, the SS register contains 2FOOI6, and the word stored at memory 
location 2840016 contains A02016, then executing the instruction 


PUSH 
[BX) 


will store AOl6 at memory location 2FFFFI6 
and 2016 at memory location 2FFFEI6. 
The 
SP register will be adjusted to OFFEI6. 


Data 
Memory 


(R elative to the 
SS Register! 


Data 
(Relative to the 


Memory 
DS Register) 


SP 
ss 
ss 
yy 
zzzzh 


BP 
xx 
zzzzh + 1 


SI 
zzzzh + 2 


DI 


PC 
mm 
mm 


Program 
(Relative 
to the 
C5 
nn 
nn 
Memory 
C5 Register! 


D5 
ww 
ww 
FF 
ppppm 


55 
tt 
tt 
37 
ppppm + 


ES 
ppppm + 2 


ppppm + 3 
• Source Data Memory Address Calculation 
•• Program Memory Address Calculation 


••• Stack Data Memory Address Calculation 


PUSH 
[BX] 
Number of cycles: memory 16 + EA 
register 11 


This instruction 
is not typically used to push a register onto the stack. The instruc- 
tion PUSH reg performs 
this function 
and only occupies one byte of program 
memory space. 


No statuses are affected. 


PUSH 
reg 


Write to the Top of the Stack 


This instruction 
pushes the contents of the specified 16-bit register onto the top of 
stack. 


The encoding is: 


PUSH 
reg 


01010rrr -..-L 3 bits which specify the 16-bit register 
to be pushed. 
rrr = 000 for AX 
001 for ex 
010 for DX 
011 for BX 
100 for SP 
101 for BP 
110 for SI 
111 for DI 


As an example, consider the instruction 


PUSH 
SI 


This instruction 
would push the 16-bit contents 
of the SI register onto the stack. This 
function is performed 
as follows: 


I. 
Decrement 
the stack pointer by I. 


2. 
Store the high-order 
8 bits of the specified register into the memory location 
addressed 
by the stack pointer and the stack segment. 


3. 
Decrement 
the stack pointer by 1. 


4. 
Store the low-order 8 bits of the specified register into the memory location 
addressed 
by the stack pointer and the stack segment. 


The stack pointer is left pointing at the last element stored into the stack, com- 
monly referred to as the top of stack. For the 8086 this is actually one 16-bit transfer. 


8086 Assembly Language Instruction 
Set 
3-213 


Data 
(Relative to the 


0 
0 
T 
S 
Z 
A 
P 
C 
Memory 
SS Registerl 


PSW 


yy 
uuuus - 
2 


::1 


xx 
uuuus - 
1 


uuuus 


OX 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
ss 
ss 
56 
ppppm 


BP 
ppppm + 1 


SI 
xx 
yy 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


OS 


SS 
tt 
tt 


ES 


PUSH 
SI 
Number of cycles: 
10 


Notes: 


1. 
This instruction 
cannot be used to push the segment registers or the Flags register. 


To push segment registers, consult the instruction 
PUSH segreg. To push the Flags 
register, consult the PUSHF instruction. 


2. 
This instruction 
is most effective 
when used after the stack pointer 
has been 


initialized. In fact, the only time this instruction should be used is after the initializa- 
tion of the stack pointer. 


3. 
To retrieve the data from the stack, use the POP instructions. 


4. 
This instruction 
performs the same function as the 8080 instruction 
PUSH reg. 


This instruction 
pushes the contents of the specified 16-bit segment register onto 
the top of stack. 


The encoding is: 


PUSH 
segreg 


000551101 
.....,L-2 bits specifying 
the segment register 
that is to be pushed onto the Stack 


55 = 00 for ES 
10 for SS 
11 for DS 


This instruction 
will push the 16-bit contents 
of the DS register onto the stack. 
Illegal operation 
if ss = 01. 


0 
D 


PSW 
~~I 


SP 


BP 


SI 


DI 


PC 


CS 


DS 


SS 


ES 


Data 
(Relative to the 


Memory 
SS Register) 


yy 
uuuus - 
2 


xx 
uuuus - 
1 


uuuus 


Program 
(Relative 
to the 


Memory 
CS Register) 


ss 
ss 
1E 
ppppm 


ppppm + 1 


ppppm + 2 


ppppm + 3 


mm 
mm 


nn 
nn 


xx 
yy 


tt 
tt 


PUSH 
DS 


Number of cycles: 
10 


Notes: 


1. 
This instruction 
can only be used to push the contents of segment registers onto the 
stack. To push the contents 
of other registers, consult the PUSH reg and PUSHF 
instructions. 


2. 
For a more detailed description 
of the action of the PUSH operation, 
consult the 
PUSH reg instruction. 


3. 
Remember 
that to ensure optimal results, the stack pointer must be initialized. 


This instruction 
pushes the contents of the Flags register onto the top of the stack. 
The format for the Flags register is: 


Bits 15-8 are stored onto the stack first, followed by bits 7-0. 
The encoding for this instruction 
is: 


PUSHF-.- 
9C 


As an example, 
should the Interrupt, 
Sign, and Zero flags be 1, while the Overflow, 
Direction, 
Trap, Arithmetic, 
Parity, and Carry flags are 0, then executing a 


PUSHF 


1. 
Decrement 
the stack pointer. 


2. 
Store the byte XXXXOOI0 into the memory location addressed 
by the stack 
pointer and Stack Segment register. (X refers to an undefined 
value.) 


3. 
Decrement 
the stack pointer. 


4. 
Store the byte 11XOXOXOinto the memory location addressed 
by the stack 
stack pointer and Stack Segment register. For the 8086 this is performed 
as a 
single 16-bit transfer. 


Data 
(Relative to the 


Memory 
55 Register) 


P5W 


yy 
uuuus - 
2 
~~I 


xx 
uuuus - 
1 


uuuus 


Program 
(Relative 
to the 


Memory 
C5 Register! 


5P 
ss 
ss 
9C 
ppppm 


BP 
ppppm + 1 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
Program Memory 
Address Calculation 


55 
tt 
tt 


E5 


PU5HF 
Number of cycles: 
10 


1. 
Remember 
that, as with all stack instructions, 
this instruction 
works best after the 
stack pointer has been initialized. 


2. 
This instruction 
does not perform the same function as the 8080 instruction 
PUSH 
PSW. The PUSH PSW instruction pushes the contents of the accumulator 
as well as 
the 8080 flags. To emulate PUSH PSW, consult the LAHF instruction. 


Rotate 
the contents 
of the specified 
register 
or memory 
location 
left by the 
specified number of bits through the Carry status. The number of bits to rotate, repre- 
sented by the variable count, is either one or the number contained in the CL register. 
An 8- or 16-bit operand may be specified. 
The encoding for this instruction 
is: 


RCL 


110100cw 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit 
operand 
w = 1 
16-bit 
operand 


c = 0 
Rotate left one bit 
c = 1 Rotate left the number of bits 


specified by the CL register 


Suppose that the AX register contains 
FB0016 and the Carry status 
is O. After the 
instruction 


RCL 
AX,1 


has executed, the Carry status will be 1 and the AX register will contain F60016• 


Data 
Memory 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


Program 
(Relative 
to the 


Memory 
CS Register) 


DO 
ppppm 
+ 1 


ppppm 
+ 2 


RCL 
AX,1 


Number of cycles: 
Register (1-bit rotate): 
2 
Register (N-bit rotate): 
8 + 14 • N) 
Memory 
11-bit rotatel: 
15 + EA 
Memory (N-bit rotate): 
20 + EA + (4' 
N) 


This instruction 
performs 
the same function 
as the 8080 instruction 
RAL. This 
instruction does, however, allow a great deal more flexibility in that multi-bit rotates 
are allowed, rotations of 16-bit quantities are allowed, and any register or memory 
location may be rotated. 


Note that it is not intuitively obvious whether an 8- or 16-bit rotate is to be per- 
formed, The manner in which this is determined 
depends on your assembler. 
For a 


discussion of this entertaining 
subject, refer to the end of this chapter. 


Rotate 
the contents 
of the specified register or memory 
location right by the 
specified number of bits through the Carry status. The number of bits to rotate, repre- 
sented by the variable count, is either one or the number contained in the CL register. 
An 8- or 16-bit operand may be specified. 
The encoding for this instruction 
is: 


RCR 
mem/reg.count 


110100cw 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit operand 
w = 1 16-bit operand 


c = 0 
Rotate right one bit 
c = 1 Rotate right the number of bits 
specified by the CL register 


Suppose that the CX register contains F70916 and the Carry status is 1. After the instruc- 
tion 


0 
D 


PSW 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 


CS 


DS 


SS 


ES 


Data 
Memory 


xx 
yy 


Program 
(Relative to the 


Memory 
CS Register) 


D3 
ppppm 


D9 
ppppm + 1 


ppppm + 2 


ppppm + 3 


mm 
mm 


nn 
nn 
Program Memory 
Address Calculation 


RCR 
CX,CL 


Number of cycles: Register (N-bit rotatel: 
8 + (4 • N) 
Register l1-bit rotatel: 
2 
Memory (N-bit rotatel: 
20 +EA + (4 • N) 
Memory l1-bit rotate): 
15 + EA 


Notes: 


1. 
This instruction 
performs 
the same function 
as the 8080 instruction 
RAR. This 


instruction does, however, allow a great deal more flexibility in that multi-bit rotates 
are allowed, rotations of 16-bit quantities are allowed, and any register or memory 
location may be rotated. 


'2, 
Differentiating 
between an 8- or 16-bit rotation is not obvious when one considers 
this instruction. 
For a discussion of this problem, see the end of this chapter. 


Repeat the following string instruction 
until the CX register 
has been decre- 
mented to zero. All string instructions 
will continue 
to execute until CX is 0 with the 
exception of the SCAS and CMPS instructions, 
which will cease to execute if the value 
of the ZF flag is equal to the low-order bit of this instruction, 
the z bit. 


The encoding for this instruction 
is: 


REP/REPE/REPNE 


~ 
Lz is a don't care bit if the following 
string primitive is: 


MOVS 
LODS 
STOS 
if the folioing string primitive is: 


CMPS 
SCAS 
then if z = 0, then the CMPS or SCAS instruction 
will cease execution if the ZF flag is 0 
z = 1, then the CMPS or SCAS instruction will 
cease execution if the ZF flag is 1 


In the following instruction 
sequence 


MOV 
SI.IOBUF 
LES 
DI,ADDR 
MOV 
CX,COUNT 
REP 
MOVB 


0 
D 


P5W 


;~I 


5P 


BP 


51 


DI 


PC 
mm 


~~I 


nn 


Data 
Memory 


Program 
(Relative to the 


Memory 
C5 Register) 


REP 


Number of cycles: 2 incurred only for recognition of 
the repeat prefix, and not included 
with each iteration of the following 
string primitive 


1. 
The encoding for REPE and REPZ is F316• The encoding for REPNE and REPNZ is 
F216• 


2, 
REP is referred to as an instruction 
prefix. Other prefixes include LOCK and SEG, 


If REP is combined with a LOCK or SEG prefix, certain precautions must be taken, 
Consult the next chapter for a discussion of these precautions. 


Pop the top two stack bytes into the program counter; these two bytes provide the 
offset address of the next instruction 
to be executed. 
Pop the next two stack bytes into 
the CS register; these two bytes provide the code segment address of the next instruc- 
tion to be executed. Previous program counter and Code Segment register contents are 
lost. 


The encoding for this instruction 
is: 


RET 


CB 


0 
0 
T 
S 
Z 
A 
P 
C 


PSW 
~~I 


SP 
ss 
ss 


BP 


Sl 


01 


PC 
mm 
mm 


CS 
nn 
nn 


OS 


SS 
tt 
tt 


ES 


41 


RET 
Number of cycles: 
18 


Notes: 


Data 
(Relative to 


Memory 
the SS Register! 


yy 
uuuus 


xx 
uuuus 
+ 1 


ww 
uuuus + 2 


vv 
uuuus + 3 


uuuus + 4 


Program 
(Relative 
to the 


Memory 
CS Reglsterl 


CB 
ppppm 


ppppm + 1 


ppppm + 2 


ppppm + 3 


1. 
Every subroutine 
should have at least one RET instruction. 
This instruction 
is the 
last instruction 
executed 
in the subroutine 
and returns control to the calling pro- 
gram. 


2. 
This RET instruction 
corresponds 
to the two intersegment 
CALLs, intersegment 
direct and intersegment 
indirect. 


3. 
No statuses are affected. 


Move the contents of the two top stack bytes to the program counter; i.e., pop the 
stack into the program counter. 
These bytes provide the offset address of the next 
instruction 
to be executed. 
Previous program counter contents 
are lost. 


The encoding for this instruction 
is: 


RET 


C3 


Data 
(Relative to the 
0 
0 
T 
5 
Z 
A 
P 
C 
Memory 
55 Register! 


P5W 
yy 
uuuus 


xx 
uuuus + 1 
~~I 


uuuus + 2 


Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
ss 
ss 
C3 
ppppm 


BP 
ppppm + 1 


51 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


05 


55 
tt 
tt 


E5 


RET 
Number of cycles: 8 


Notes: 


1. 
This instruction 
performs the same function as the 8080 instruction 
RET. 


2. 
Every subroutine 
should 
contain 
at least one RET instruction; 
this is the last 
instruction 
executed 
within the subroutine 
and causes execution 
to return 
to the 
calling program. 
Other 
methods 
may be used to return 
to a calling program; 


however, typically they are less efficient and more obscure than the straightforward 
RET instruction. 


3. 
The 8086 offers three other kinds of RETs. These RETs have some correspondence 
to the CALL 
instructions. 
This RET corresponds 
to CALL 
disp 
and -CALL 
mem/reg 
indirect intrasegment. 


4. 
No statuses are affected. 


Pop the two top stack bytes into the program counter; these two bytes provide the 
offset address for the next instruction 
to be executed. 
Pop the next two stack bytes into 
the CS register; these two bytes provide the code segment address of the next instruc- 
tion to be executed. Previous program counter and Code Segment register contents are 
lost. Add the data in the two succeeding 
program memory bytes to the stack pointer. 


This has the effect of adjusting the stack pointer past parameters 
that might have been 
placed onto the stack prior to the CALL that corresponds 
to this RET. 


The encoding for this instruction 
is: 


RET-.- 
CA 
disp16 
L 


k~"""-"<.,, 
8 ,,,' ,,, •••"-M """,Md 
displacement that is added to the Stack 
Pointer 


------Low-order 
8 bits of the 16-bit unsigned 
displacement that is added to the Stack 
Pointer 


RET 
jjkk 
Number of cycles: 
17 


Data 
IRelative to the 


Memory 
SS Register! 


yy 
uuuus 


xx 
uuuus + 1 


ww 
uuuus + 2 


vv 
uuuus + 3 


uuuus + 4 


Program 
IRelative 
to the 


Memory 
CS Register) 


CA 
ppppm 


kk 
ppppm + 1 


jj 
ppppm + 2 


ppppm + 3 


1. 
No statuses are affected. 


2. 
Every subroutine 
should have at least one RET instruction. 
This instruction 
is the 


last instruction executed in the subroutine 
and resumes execution in the calling pro- 


gram at the instruction 
after the corresponding 
CALL. 


3. 
This RET instruction 
corresponds 
to the two intersegment 
CALLs, intersegment 
direct and intersegment 
indirect. 


Pop the stack into the program counter; 
the two bytes moved provide the offset 
address of the next instruction 
to be executed. 
Previous program counter contents 
are 
lost. Add the data in the two succeeding program memory bytes to the stack pointer. 
This has the effect of adjusting the stack pointer past parameters 
that might have been 
placed onto the stack prior to the CALL that corresponds 
to this RET. 
The encoding for this instruction 
is: 


RET 
disp16 
-.- 
C2L 


k 


k 


_ii 
I 
L High-order 8 bits of the 16-bit unsigned 
displacement that is added to the Stack Pointer 


Low-order 8 bits of the 16-bit unsigned 
displacement that is added to the Stack Pointer 


Data 
(Relative to the 


0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
SS Register! 


PSW 
yy 
uuuus 


xx 
uuuus 
+ 1 
~~I 


uuuus + 2 


Program 
(Relative 
to the 


Memory 
CS Reglsterl 


SP 
ss 
ss 
C2 
ppppm 


BP 
kk 
ppppm + 1 


Sl 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 
tt 
tt 


ES 


RET 
jjkk 
Number of cycles: 
12 


Notes: 


1. 
Every subroutine 
should 
contain 
at least one RET instruction; 
this is the last 


instruction 
executed 
within the subroutine 
and causes execution 
to return to the 
calling program. 


2. 
The 8086 offers three other kinds of RETURN instructions. 
These RETURNs have 
some correspondence 
to the CALL instructions. 
This RET corresponds 
to CALL 


disp and CALL mem/reg 
indirect intrasegment. 


3. 
No statuses are affected. 


Rotate 
the contents 
of the specified register 
or memory 
location 
left by the 
specified number 
of bits. The number 
of bits to rotate, 
represented 
by the variable 


count, is either one or the number contained 
in the CL register. 


The encoding for this instruction 
is: 


Addressing mode byte (s) as described 
earlier in this chapter 


w = 0 
S-bit operation 
w = 1 
16-bit operation 


c = 0 
Rotate left one bit 
c = 1 
Rotate left the number of bits 
specified by the CL register 


Suppose that the BX register contains ABIF16 and the CL register contains 0316• After 
the instruction 


Data 
0 
D 
T 
5 
Z 
A 
P 
C 
Memory 


P5W 


AX 


BX 
xx 
yy 


CX 
zz 


DX 


Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
D3 
ppppm 


BP 
C3 
ppppm + 1 


51 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 
Program Memory 


D5 
Address Calculation 


55 


E5 


ROL 
BX.CL 
Number of cycles: Register (N-bit rotate): 
B + (4· 
NI 
Register tl-bit 
rotate): 
2 
Memory (N-bit rotate): 
20 + EA + (4 • N) 


Memory tl-bit 
rotate): 
15 + EA 


Notes: 


1. 
This instruction 
performs 
the same function 
as the 8080 instruction 
RLC. This 
instruction does, however, allow a great deal more flexibility in that multi-bit rotates 
are allowed, rotations of 16-bit quantities are allowed, and any register or memory 
location may be rotated. 


2. 
Whether an 8- or 16-bit quantity is to be rotated is not immediately obvious when 
considering the syntax of this instruction. 
The assembler used will have a great deal 
to do with how this difficulty is solved. 


Rotate 
the contents 
of the specified register or memory 
location right by the 
pecified number 
of bits. The number 
of bits to rotate, 
represented 
by the variable 


'ount, 
is either one or the number contained 
in the CL register. 
The encoding for this instruction 
is: 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
B-bit operand 
w = 1 
16-bit operand 


c = 0 
Rotate right one bit 
c = 1 
Rotate right the number of bits 


I 
specified by the CL register 


Suppose that the DS register contains 
FOOOI6, the SI register contains 06B216 
and the 
~yte at memory location F06B216 contains 0416, After the instruction 


ROR 
[51],1 


has executed, 
the byte at memory location F06B216 
will contain 0216 and the Carry and 
Overflow statuses will be set to O. 


(Relative to the 
DS Register) 


PSW 
~~I 


SP 


BP 


SI 
uu 
uu 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
It 
tt 


SS 


ES 


Program 
(Relative to the 


Memory 
CS Register! 


DO 
ppppm 


DC 
ppppm 
+ 1 


ROR 
ISil.1 


Number of cycles: Memory (l-bit 
rotate): 
15 + EA 
Memory (N-bit rotate): 
20 + EA + (4 • N) 
Register (l-bit 
rotate): 
2 
Register (N-bit rotate): 
B + (4 • N) 


Notes: 


1. 
This instruction 
performs 
the same function 
as the 8080 instruction 
RRC. This 
instruction does, however, allow a great deal more flexibility in that multi-bit rotates 
are allowed, rotations of 16-bit quantities are allowed, and any register or memory 
location may be rotated. 


2. 
Note that whether 
an 8- or 16-bit quantity is to be rotated cannot be determined 
from the instruction 


This instruction moves the contents of the AH register into the low-order 8 bits of 
the Flags register. The bits in the AH register are used as follows: 


Bit 7: 
Store into the Sign Flag 


Bit 6: 
Store into the Zero Flag 


Bit 5: 
Ignore 
Bit 4: 
Store into the Arithmetic 
Flag 


Bit 3: 
Ignore 
Bit 2: 
Store into the Parity Flag 


Bit 1: 
Ignore 
Bit 0: 
Store into the Carry Flag 


The encoding for this instruction 
is: 


SAHF 
-.- 
9E 


For example, suppose that the AH register contains E716• Executing an 


SAHF 


instruction 
would set the Sign, Zero, Parity, and Carry statuses 
to 1 while setting the 
Arithmetic 
flag to O. 


Data 


Memory 


I 
Program 
(Relative to the 


Memory 
CS Register) 


SAHF 
Number of cycles: 4 


Notes: 


1. 
No registers, 
other than the Flags register, are affected. The Overflow, Direction, 


Interrupt, 
and Trap flags are not affected. 


2. 
This instruction 
is used along with POP AX to emulate 
the 8080 instruction 
POP PSW. 


8086 Code 
8080 Code 


POP 
AX 
POP 
PSW 
SAHF 


LAHF 
PUSH 
AX 


must have been used to save the 8080 flags. 


Shift the contents 
of the specified register 
or memory 
location 
right by the 
specified number of bits. The number of bits to shift, represented 
by the variable count, 
is either one or the number contained in the CL register. This is an arithmetic right shift. 


The encoding for this instruction 
is: 


SAR 
mem/reg,count 


10100cw 


Addressing mode bytelsl 
as described 
earlier in this chapter 


w = 0 
8-bit operation 
w = 1 
16-bit operation 


c = 0 
Shift right one bit 
c = 1 
Shift right the number of bits 
specified by the CL register 


Suppose that the CL register contains 
0516, the 01 register contains 
180A 16' the OS 
register contains 
F80016 and the word at memory 
location F980A16 
contains 
006416, 


After the instruction 


SAR 
[DIJ.CL 


has executed, the word at memory location F980A 16 will be 000316, 


0 
D 
T 
S 
Z 
A 
P 
C 


PSW 
~~I 


xx 


SP 


BP 


SI 


DI 
uu 
uu 


PC 
mm 
mm 


CS 
nn 
nn 


DS 
It 
tt 


SS 


ES 


Data 


Memory 
(Relative to the 
DS Register) 


Program 
(Relative 
to the 


Memory 
CS RegISter! 


Program Memory 
Address Calculation 


SAR 
[DIl.CL 
Number of cycles: 
Memory (N-bit shift): 
20 + EA + 14 • N) 
Memory (l-bit 
shift): 
15 + EA 
Register IN-bit shift): 
8 + (4 • N) 
Register (l-bit 
shift): 
2 


1. 
This is an arithmetic 
right shift as opposed to a logical right shift. The differences 
are: 


Shift all bits right once. Leave the high- 
order bit in the same state. This has the 
effect of sign extending 
the high-order 


bit. If a multi-bit 
shift is performed, 


sign extend the high-order 
bit as far as 


is necessary. 


Shift all bits right once. Shift a zero into 
the high-order 
bit. If a multi-bit shift is 


performed, 
continue 
shifting 
in zeros 


as necessary. 


Subtract the immediate data in the succeeding program memory byte(s) from the 


AL (8-bit operation) 
or AX C16-bit operation) 
register with borrow. The subtraction 
is 


performed 
using twos complement 
methodology. 


The encoding for this instruction 
is: 


SBB 


0001110w ep c=e,,,"_,,,,, "".M 
". 
'mm"'". 
L 
operand.This byte is only presentif w = 1 


-------,Low-order 
byte 
of 
the 
immediate 


operand.This byte is always present 


w 
= 
0 
a-bit 
operation. 
AL 
is 
subtrahendand destinationfor result 


Suppose that the AX register contains 
6B3AI6 and the Carry status is 1. After the 


instruction 


6B3A16=0110 
1011 
0011 
1010 


Two'sComp.of4D2C16 
= 1011 
0010 
1101 
0100 
Two's Compoof Carry = 1111 
1111 
1111 
1111 


0001 
1110 
0000 
1101 
I 


t 
t 
3 one bits, P is set to a 
AF is set to 1 
Overflow is set to a 


-----------Sign 
is set to a 


Carry out of the high-order bit is 
inverted, Carry is set to 1 


Non-zeroresult. set Z to 0 


SBB 
AX,jjkk 
Number of cycles: 4 


Notes: 


I, 
This instruction 
performs 
the same function 
as the 8080 instruction 
SBI data; 


however, 
this instruction 
also allows 16-bit operations. 


Subtract the immediate data in the succeeding program memory byte(s) from the 
specified register or memory location with borrow. An 8- or 16-bit operation 
may be 
specified. The subtraction 
is performed 
using twos complement 
methodology. 


The encoding for this instruction 
is: 


L 


k 
~ 
High-order 
byte 
of 
the 
Immediate 


operand. This byte ISonly present If 
s = a and w = 1. 


------Low-order 
byte 
of 
the 
immediate 


operand. This byte is always present. 


Addressing mode bytelsl 
as described 


earlier in this chapter. 


w = a 
8-bit operation 
w = 1 
16-bit operation 


s is the sign extension bit. If w = O. this 
bit is ignored. 
If w = 1. then s = 0; all 16 bits of the 
immediate operand are present. 


s = 1. only the low-order 8 bits of the 
immediate 
operand 
are present. The 


high-order 8 bits of the 16-bit operand 
are formed by sign extending the high- 
order bit of kk. 


For example, 
if the Carry Status is 0, the SS register contains 2FOOI6, the BP register 
contains OF6AI6, the contents of the DI register are 001816, and the contents of the word 
at memory location 2FF8216 are 040016, then executing a 


SBB 
IBP + 51!. 03F8H 


till result in the word at memory location 2FF8216 being altered to 000816, 


04001 6 = 0000 
01 00 
0000 
0000 
Two's Compoof 03F816 
= 1111 
1100 
0000 
1000 


Two's Compoof Carry = 0000 
0000 
0000 
0000 


0000 
0000 
0000 
1000 
I 


t 
t 
lone 
bit. P is set to a 


AF is set to a 


Overflow is set to a 
_____ 
Sign is set to a 


Carry out of high-order bit is 
inverted. Carry is set to a 


Non-zero result, set Z to 1 


Data 
(Relative to the 
0 
D 
T 
S 
Z 
A 
P 
C 
Memory 
SS Register) 


PSW 


yy 
vvvv 
~~I 


xx 
vvvv 
+ 


vvvv + 2 


Program 
(Relative 
to the 


Memory 
CS Register) 


SP 
81 
ppppm 


8P 
rr 
rr 
18 
ppppm + 


SI 
uu 
uu 
kk 
ppppm + 2 


DI 
ppppm 
:+- 3 


PC 
mm 
mm 


CS 
nn 
nn 
Program Memory 
Address Calculation 


DS 


SS 
It 
It 


ES 


Number of cycles: 
Memory operand: 
17 + EA 
Register operand: 
4 


SBB 
mem/reg" 
mem/reg2 


Subtract: 
Register from Register with Borrow 
Register from Memory with Borrow 
Memory from Register with Borrow 


Subtract the contents 
of the register or memory location specified by mem/reg2 


and the Carry status from the contents 
of the register or memory location specified by 
mem/reg1• 
An 8- or 16-bit operation 
may be specified. Either mem/regl 
or mem/reg2 


may be a memory operand, 
but one of the operands must be a register operand. 


The encoding for this instruction 
is: 


SBB 


000110dw 


Addressing mode byte(s) as described 
earlier in this chapter. 


w = 0 
a-bit operation 
w = 1 
16-bit operation 


d is the direction flag. If d = O. then the 
operand described by mod and rim is 
mem/reg 1 and the operand described 
by reg is mem/reg2. 
If d = 1. then the 
operand described by mod and rim is 
mem/reg2 
and the operand described 
by reg is mem/reg 1 


Consider the case where the DL register contains 0316, the BL register contains 6416, and 
the Carry status is 1. After the instruction 


SBB 
BL.DL 


6416 
= 0110 
0100 


Two·s Compo of 0316 
= 1111 
1101 


Two·s Compo of 1 = 1111 
1111 


~ 


O~'W~, 
0"'",.", 
," '0' 


Carry out of bit 3. AF is set to 0 
Overflow is set to 0 


Sign is set to 0 


Carry out of high-order 
bit. Carry is set to 0 


Non-zero result. set Z to 0 


SBB 
BL,DL 
Number of cycles: Register to register: 3 
Register to memory: 
16 + EA 
Memory to register: 9 + EA 


JCAS 


dompare Memory with AL or AX Register 


Compare the contents 
of the memory location addressed by the 01 register with 
tIe 
AL (8-bit operation) 
or AX 06-bit 
operation) 
register. The comparison 
is per- 
f rmed by subtracting the contents of the memory location addressed by the 01 register 
f om the AL or AX register and using the result to set the flags. Neither the memory 


I, cation nor the AX register is affected. The 01 register is incremented/decremented 
d pending on the value of the OF flag. 


The encoding for this instruction 
is: 


SCAS 


I 
1 01 
0 1 1Lw 
= 0 


An B-bit comparison. The 01 
register is incremented by 1 if 
OF = O. The OF register is 
decremented by 1 if OF = 1 


A 16-bit comparison. The 01 
register is incremented by 2 if 
OF = O. The OF register is 
decremented by 2 if OF = 1 


Son sider the case where the 01 register contains 000016, the ES register contains 180016, 
t~le OF flag is 0, the contents of memory location 1800016 are 0916, and the contents of 
tHe AL register are 0016, 
After the instruction 


I 
SCAS BYTE 


executes, the 01 register will contain 000116 and the flags will be set as follows: 


001 6 = 0000 
1101 


Two's Compoof 0916 
= 1111 
0111 


~ 


0000T010L 
lone 
bit, P is set to 0 


~ 
Carry out of bit 3, AF is set to 1 


Overflow is set to 0 


Sign is set to 0 


Carry out of high-order bit 
is complemented, Carry is set to 0 


Non-zero result, set Z to 0 


-'- 
0 
D 


PSW 


AX 


ex 


CX 


DX 


SP 


BP 


SI 


DI 
gg 


PC 
mm 


Data 


Memory 
(Relative to the 
ES Register) 


Program 
(Relative 
to the 


Memory 
CS Register) 


SCAS 
Number of cycles: 
15 for a single occurrence 


9 + (15 times repetition when preceded by REPprefix) 


1. 
The 
REP 
prefix 
and/or 
the 
LOCK 
prefix 
may be used 
with this 
instruction. 
If the 
REP 
prefix 
and 
the 
LOCK 
prefix 
are used 
with 
this 
instruction, 
certain 
problems 
may result. 
An analysis 
of this difficulty 
is presented 
in the next 
chapter. 


Use the specified segment register to compute the data memory address for the 
instruction this prefix precedes; i.e., use the contents of the specified segment register as 
the segment address for the data memory address calculation. 
The encoding for this instruction 
is: 


SEG 
segreg 


1001551101L 55 is two 
bits specifying 
the segment 
register. 


55 = 00 for ES 
01 for CS 
10 for SS 
11 for OS 


Consider the following situation: the DS register contains 
100016, the ES register con- 
tains 200016, 
the BX register contains 000816, 
the word at memory location 1000816 is 
FEFEI6, 
and the word at memory location 2000816 is 060A16. After the instructions 


SEG 
ES 
MOV 
AX.[BX) 


have executed, the AX register will contain 060A16. 


SEG ES 
Number of cycles: 2 


Dala 


Memory 


Program 
(Relallve to the 


Memory 
CS Register) 


SHL 
SAL 
mem/reg,count 
mem/reg,count 


Shift the contents of the specified register or memory location left by the specified 
number of bits. The number of bits to shift, represented 
by the variable count, is either 
one or the number 
contained 
in the CL register. This is a logical left shift. 
The encoding for this instruction 
is: 


SHL 
mem/reg.count 


1 1 0 1 0 0 c w 
mod 100 rim 


~ 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit 
operand 
w = 1 
16-bit 
operand 


c = 0 
Shift left one bit 
c = 1 
Shift left the number of bits 
specified by the CL register 


Suppose that the CL register contains 0216 and the Sf register contains A45016• After the 
nstruction 


Program 
(Relative to the 


Memory 
CS Register! 


SHL 
SI,CL 
Number of cycles: Register (N-bit shift): 8 + (4· 
N) 
Register (1-bit shift): 2 
Memory (N-bit shift): 20 + EA + (4· 
N) 
Memory (1-bit shift): 15 + EA 


Notes: 


1. 
This instruction 
can be used to perform multiplications 
in addition to shifts, Since 
the MUL and IMUL instructions 
require at least 71 cycles to execute, 
there are 


occasions where using shifts to perform multiplication 
becomes an attractive solu- 


tion, Typically, these situations arise when optimizing the speed of the code is more 
of a factor than memory conservation, 
and when the multiplication to be performed 
will always be a power of two, or will always be some constant. Consider the follow- 
ing cases: 


MOV 
SAL 
RET 


The MUL T$BY$8 routine requires 5 bytes of code for the routine and 3 bytes of 
code for the CALL. Instead 
of requiring 
71 cycles (minimum) 
to perform 
the 
multiply, however, 
19 cycles are necessary for the CALL and 32 cycles are necessary 
for the routine. 


SAL$THREE$TIMES 
SAL 
SAL 
SAL 
RET 


This routine requires an additional 2 bytes; however, this routine executes in a mere 
14 cycles. 


It is clear that selecting routines which only multiply by powers of two will certainly 
show off the SHL instruction. 
Consider the case of a multiply by 15. 


CALL 
MUL T$BY$l 
5 


MOV 
MOV 
SAL 
SUB 
RET 


CL,4 


DX.AX 
AL.CL 
AX.DX 


This routine requires 9 bytes of code and 41 cycles, an additional 19 for the CALL. 
This is only marginally faster than using the -MUL instruction. 
This routine 
can 
work much faster if individual SAL instructions 
are included. 


CALL 
MUL T$BY$l 
5 


MOV 
SAL 
SAL 
SAL 
SAL 
SUB 
RET 


In this case, the routine needs only 21 cycles to operate. 


2. 
8- or 16-bit rotation? 
This instruction, 
the way it is expressed 
in this description, 


doesn't specify this. 


Shift the contents 
of the specified register 
or memory 
location 
right by the 
specified number of bits. The number of bits to shift, represented 
by the variable count, 
is either one or the number contained in the CL register. The bit shifted into the high- 
order bit is a zero. This is a logical right shift. 
The encoding for this instruction 
is: 


SHR 
mem/reg.count 


110100cw 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit 
operand 
w = 1 
16-bit 
operand 


c = 0 
Shift right one bit 
c = 1 
Shift right the number of bits 
specified by the CL register 


Suppose that the BL register contains FOI6. After the instruction 


SHR 
8L 


Data 
Memory 


P5W 


AX 


BX 
shift right 


ex 


DX 


5P 


BP 


51 


01 


PC 
mm 
mm 


e5 
nn 
nn 


05 


55 


E5 


Program 
IRelative 
to the 


Memory 
e5 Register! 


Number of cycles: Register 11-bit shift): 
2 
Register IN-bit shiftl: 
8 + (4 • N) 
Memory (1-bit shiftl: 
15 + EA 
Memory (N-bit shift): 
20 + EA + (4· N) 


Notes: 


This is a logical right shift as opposed to an arithmetic right shift. The differences are 


Logical right (SHR) 
Shift all bits right once. Shift a zero into the high-order 
bit. If a multi-bit shift is performed, 
continue shifting in 
• 
zeros as necessary. 


Arithmetic 
right (SAR) 
Shift all bits right once. Leave the high-order 
bit in the 


same state. This has the effect of sign extending 
the 


high-order 
bit. If a multi-bit 
shift is performed, 
sign 


extend the high-order 
bit as far as is necessary. 


This instruction 
is used to set the Carry status to 1. No other statuses or register 
contents 
are affected. 


The encoding for this instruction 
is: 


STC 
-.- 
F9 


Data 


Memory 


::1===== 
DX -----_ ......•_----_ ......• 
Program 
(Relative to the 


Memory 
CS Register) 


STC 
Number of cycles: 2 


This instruction 
is used to set the Direction flag to 1. No other statuses or register 
contents are affected. This instruction 
makes string operations perform auto-decrement 


on the pointers used by the string operations. 
The encoding for this instruction 
is: 


STD 
-.- 
FD 


ODI 
TSZAPC 
PSWCQIIllID 
_ 


Data 
Memory 


AXI 
BX ~------~------ 
CX >--------0-------- 
DX --------~------ 
Program 
(Relative 
to the 


Memory 
CS Register) 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


STD 
Number of cycles: 
2 


Set the Interrupt 
flag to 1 after the execution of the next instruction. 
This has the 
effect of enabling interrupts. 
The reason 
for waiting one instruction 
is as follows. Most interrupt 
service 


routines 
end with the two instructions: 


STI 
; ENABLE INTERRUPTS 
RET 
; RETURN TO INTERRUPTED PROGRAM 


If interrupts 
are processed serially, then for the entire duration 
of the interrupt 
service routine 
all interrupts 
are disabled - 
which means 
that in a multi-interrupt 
application, 
there is a significant possibility for one or more interrupts 
to be pending 
when any interrupt 
service routine completes execution. 


If interrupts 
were acknowledged 
as soon as the STI instructions 
had executed, 
then the Return instruction 
would not be executed. Under these circumstances 
returns 
would stack up one on top the other and unnecessarily 
consume stack memory space. 
This may be illustrated as follows: 


By inhibiting interrupts 
for one more instruction 
following execution 
of STI, the 8086 
CPU ensures that the RET instruction 
gets executed in sequence: 


STI 
;ENABLE INTERRUPTS 
RET 
;RETURN FROM INTERRUPT 


It is not uncommon 
for interrupts 
to be kept disabled while an interrupt service routine 


is executing. 
Interrupts 
are processed serially: 


PSW 


;~I 


SP 


BP 


SI 


DI 


PC 
mm 
mm 


CS 
nn 
nn 


DS 


SS 


ES 


STI 
Number 
of cycles: 
2 


Data 


Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


Store the AL (8-bit operation) or AX C16-bit operation) register into the memory 


location addressed 
by the DI register. 
The DI register is incremented/decremented 
depending 
on the value of the DF flag. 
The encoding for this instruction 
is: 


STOS 
11010101L 


w=O 
8 bits are transferred. 
The 01 
register is incremented by 1if 
OF = O. The 
01 register 
is 
decremented 
by 1 if OF = 1 


16 bits are transferred. The 01 
register is incremented 
by 2 if 
OF = O. The 
01 register 
is 
decremented 
by 2 if OF = 1 


For example, 
suppose that the DF flag is 1, the DI register contains 
000A16, 
the ES 
register contains 280016, and the AX register contains 060416, After the instruction 


STOS 
WORO 


has executed, 
the contents of the word at memory location 2800A 16 will be 060416, 
and 
the DI register will contain 000816, 


Data 
(Relative to the 


Memory 
ES Registerl 


Program 
(Relative 
to the 


Memory 
CS Registerl 


Program Memory 
Address Calculation 


Notes: 


1. 
No statuses are affected. 


STOS WORD 
Number of cycles: 11 per single Occurrence 
9 + (10 times repetition when preceded by the REP prefix) 


2. 
The segment address for this instruction 
is always contained in the ES register. No 
segment 
override 
prefix may be used for this instruction. 
If a segment 
override 
prefix is present, 
it will be ignored. 


3. 
This instruction 
may be preceded by the REP prefix and/or the LOCK prefix. Using 
the REP and the LOCK prefixes in conjunction 
with this instruction 
may cause 
problems. 
Consult 
the next chapter for a complete 
discussion 
of these potential 
difficulties. 


4. 
This instruction 
is very useful in setting entire buffers or data areas to a particular 
value. Consider the following instruction 
sequence: 


LES 
DI.JOB$COSTING$ARRAY 
MOV 
CX.JOB$COSTING$ARRAY$WORD$LENGTH 
MOV 
AX.OOOOH 


REP 
STOS WORD 


After this sequence has executed, the JOB$COSTING$ARRA 
Y will contain all D's. 


6. 
This instruction 
has an obvious 
problem. 
How does the assembler 
determine 
whether 8 or 16 bits are to be stored? For a discussion of this problem, consult the 
last section of this chapter. 


This instruction 
is used to subtract immediate data from the AL (8-bit operation) 
or the AX 06-bit 
operation) 
register. The subtraction 
is performed utilizing twos com- 
plement methodology. 
The encoding for this instruction 
is: 


SUB 


0010110w 


aC,dataL 


kc:=I=J 
L-High-order 
8 
bits 
of 
the 
Immediate 


operand. This byte ISonly present If w = 1 


-------- 
Low-order 
8 
bits 
of 
the 
immediate 


operand. This byte is always present. 


w 
= 
0 
8-bit 
operation. 
AL 
is 
the 


subtrahend and the destina- 
tion of the result. 


w 
16-bit 
operation. AX 
is the 


subtrahend and the destina- 
tion of the result. 


For example, 
suppose that the AL register contains 
6116, After the execution 
of the 
instruction 


6116 = 0110 
000 1 


Twoscomp.of6516 
= 1001 
1011 


1111 
1100 
~r t:= 50. , " ••. ," 
"" 
""" 
"" 
" 
, 
Non-zero result, set ZF to 0 


No Carry so set AF to 0 


Overflow is a 


Set the Sign flag to 1 


Set Carry to 1 


Notice that the resulting Carry is complemented. 
Note that FCI6 is the twos complement 
representation 
of -4, which is indeed the 
result we expect when we subtract 6516 from 6116, 


0 
D 


PSW 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


CS 
nn 


DS 


SS 


ES 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


SUB 
AL,kk 
Number 
of cycles: 
4 


Notes: 


1. 
This instruction 
performs the same function as the 8080 instruction 
SUI data. This 


instruction, 
however, 
also allows 16-bit operations. 


SUB 
mem/reg,data 


Subtract Immediate Data from Register or Memory Location 


Subtract the immediate data in the succeeding program memory byte(s) from the 
specified register or memory location. An 8- or 16-bit operation 
may be specified. 
The encoding for this instruction 
is: 


L 


k 


k 


__I 
iJ_t 
__ 
~" 
." 
•. 
""" 
"'" 
,',", 
'mm"'." 
operand. This byte is only present if w = 1 


The low-order 
byte of the immediate 


operand. This byte is always present. 


Addressing mode byte (s) as described 
earlier in this chapter. 


w = 0 
a-bit operation 
w = 1 16-bit operation 


s is the sign extension bit. If w = O. this 
bit is ignored. 
If w = 1. then 
s = 0 all 16 bits of 
the 
immediate 


operand are present. 


S = 1 only the low-order a bits of the 


immediate operand are present. 
The high-order a bits of the 16- 
bit operand are formed by sign 
extending the high-order 
bit of 
kk. 


Suppose that the DS register contains 300016, 
the SI register contains 004016, 
and the 
word at memory location 3005416 contains 433616, After the instruction 


sua 
[51 + 14Hl. 0136H 


has executed, the word at memory location 3005416 will contain 420016, The flags will be 
set as follows: 


433616 
= 0100 
0011 
0011 
0110 


Two's Compoof 013616 
= 1111 
1110 
1100 
1010 


0100 
0010 
0000 
0000 


11 
It, 
""M "''''",", 
low-order eight bits, set P to 1 


AF set to 0 
~----------Overflow 
to 0 
------------Sign 
to 0 


Complement Carry out, set C to 0 


Non-zero result, set Z to 0 


Program 
(Relative to the 


Memory 
CS Register) 


81 
ppppm 


6C 
ppppm + 


gg 
qq 
ppppm + 2 


kk 
ppppm + 3 


mm 
jj 


0 
D 


PSW 
~~I 


SP 


BP 


SI 
gg 


DI 


PC 
mm 


CS 
nn 


DS 
hh 


SS 


ES 


SUB [SI + qql, jjkk 
Number of cycles: immediate from memory: 
17 + EA 
immediate from register: 4 


Data 
Memory 
(Relative to the 
DS Register) 


Notes: 


1. 
This instruction 
is not usually applied to subtracting immediate data from the AX or 


AL register. The instruction 
SUB ac,data is provided for this purpose. 


SUB 
mem/re91, 
mem/re92 


Subtract: 
Register from Register 
Register from Memory 
Memory from Register 


Subtract the contents 
of the register or memory location specified by mem/reg2 


from the contents 
of the register or memory location specified by mem/regl. 
An 8- or 
16-bit operation 
may be specified. Either mem/regl 
or mem/reg2 
may be a memory 
operand, 
but one of the operands must be a register operand. 
The encoding for this instruction 
is: 


SUB 
mem/regl. 
mem/reg2 
I a a 1 a lad 
w I 
mod reg rim 
L:= 


Addressing mode bytelsl 
as described 


earlier in this chapter 


w = a 
B-bit operation 
w = 1 16-bit operation 


'---------- 
d is the Direction flag. If d = 0, then the 
operand described by mod and rim is 
mem/reg 1 and the operand described 
by reg is mem/reg2' If d = 1, then the 
operand by mod and rim is mem/reg2 
and the operand described by reg is 
mem/regl 


Suppose that the DH register contains 
4116, 
the SS register contains 
000016, 
the BP 
register contains 00E416, 
and the byte at memory location 000E816 contains SAI6• After 
the instruction 


411 6 = 01 00 
000 1 


Two's compo of 5A16 
= 1010 
0100 


1110 
0111 


~ 
1 Ls;,~..""'",0' 
No carry, set AF to 1 


Overflow is set to a 


Set S to 1 
No carry of high-order bit, set C to 1 


Non-zero result. set Z to a 


0 
D 


P5W 


AX 


BX 


CX 


DX 


5P 


BP 


51 


DI 


PC 


C5 


D5 


55 


E5 


Data 
Memory 


(Relative to the 
SS Registed 


xx 


Program 
(Relative 
to the 


Memory 
C5 Register) 


2A 
ppppm 


gg 
gg 
76 
ppppm + 


kk 
ppppm + 2 


ppppm + 3 


Number of cycles: 
Memory to Register: 9 + EA 
Register to Memory: 
16 + EA 
Register to Register: 3 


AND the immediate 
data in the succeeding 
program memory 
byte(s) 
with the 
contents of the AL (8-bit operation) 
or the AX 06-bit 
operation) 
register, but do not 
return the result to the register. 


The encoding for this instruction 
is: 


TEST 


1010100w 


ac.dataL 


kc=c 
High-order a 
bits of 
the Immediate 


operand.This byte ISonly presentIf w = 1 


--------Low-order 
byte 
of 
the 
immediate 


operand.This byte is always present. 


w = 0 
a-bit operation 
w = 1 
16-bit operation 


As an example, consider the situation where the AX register contains 73AC16• After the 
instruction 


has executed, 
the AX register will still contain 73AC16, 
but the Flags register will have 
been altered to reflect the ANDing of 73ACI6 and 004016, 


73AC16=0111 
0011 
1010 
1100 


00401 6 
= 0000 
0000 
0100 
0000 
0000 
0000 
0000 
0000 


1 
LZero one bits. set P to 1 
AF is undetermined 
Overflow is clearedto 0 
-----------sign 
is set to 0 
Carry is clearedto 0 
Zero result. set Z to 1 


TEST 
AX,jjkk 
Number of cycles: 4 


Data 
Memory 


Program 
(Relative 
to the 


Memory 
CS Register! 


Notes: 


1. 
If it is desired to TEST the contents of other registers or memory locations, consult 
the TEST mem/reg,data 
instruction, 


AND the immediate 
data in the succeeding 
program memory 
byte(s) 
with the 
contents of the specified register or memory location, but do not return the result to the 
specified register or memory location. An 8- or 16-bit operation 
may be specified. 


The encoding for this instruction 
is: 


TEST 
mem/reg.data 


I mod 000 rim ILkee 
I 


High-order 
byte 
of 
the 
immediate 


operand.This byte is only present if w = 1 


------Low-order 
byte 
of 
the 
immediate 


operand. This byte is always present. 


~--------------Addressing 
mode byte(s) as described 


earlier in this chapter. 


w = a 
8-bit operation 


w = 1 16-bit operation 


For example, consider the case where the SI register contains 03F616• 
After the instruc- 
tion 


executes, the contents of the SI register will be unchanged; 
however, the flags will be set 
to reflect the result of ANDing 03F616 and 040016, 


03F616 = 0000 
0400 16 = 0000 


0000 


1 
1 


--------------- 
Sign is set to 0 
Carry is cleared to a 


Zero result. set Z to 1 


0011 
1111 
0110 
01 00 
0000 
0000 


0000 
0000 
0000 


Zero one bits. set P to 1 


AF is undefined 
Overflow is cleared to a 


::1-...---- 
CX ------- '-------- 
DX ------- -------- 


Data 


Memory 


Program 
(Relative to the 


Memory 
CS Register) 


F7 
ppppm 


C6 
ppppm + 


kk 
ppppm + 2 


jj 
ppppm + 3 


TEST Sl,jjkk 


Number of cycles: Immediate with register: 5 
Immediate with memory: 
11 + EA 


1. 
Testing 
the AX or AL register 
would 
not be a function 
normally 
associated 
with this 


instruction, 
since 
the TEST 
aC,data 
is provided 
for this 
purpose. 


AND the contents 
of the specified register with the contents 
of the specified 
register or memory location using the result to set the flags, but not returning the result 
to the register(s) 
or memory location. An 8- or 16-bit operation 
may be specified. 
The encoding for this instruction 
is: 


TEST 
reg.mem/reg 


\1 0 0 0 0 lOw 
I 
mod reg rim 


t 
L Addressing mode bytelsl 
as described 


~ 
earlier in this section. 


w = 0 
8-bit operation 
w = 1 
16-bit operation 


Suppose that the AL register contains 4016, the OS register contains 880016, and the byte 
at memory location 8805316 is AF 16' After the instruction 


TEST AL.!+ 
531 


has executed, 
neither the AL register nor the byte at memory location 8805316 will be 
affected; however, 
the flags will be affected as follows: 


4016 = 0100 
0000 
AF16 = 1010 
1111 


0000 
0000L 


zero one bits. set P to 1 


AF is undefined 
This instruction clears Overflow 


Sign is set to 0 


This instruction clears Carry 
Zero result. set Z to 1 


PSW 


AX 


BX 


CX 


DX 


SP 


BP 


SI 


DI 


PC 
mm 


CS 
nn 


DS 
hh 


SS 


ES 


Number of cycles: 
Register with Memory: 
9 + EA 
Register with Register: 
3 


Data 
Memory 
(Relative to the 
DS Register) 


Program 
(Relative 
to the 


Memory 
CS Register) 


06 
ppppm 
+ 1 


kk 
ppppm 
+ 2 


This instruction causes the 8086 to enter an idle state if the signal on the TEST pin 
is not asserted. The 8086 will only leave the idle state if one of two conditions 
is met. 


The conditions are: 


1. 
If the interrupts 
are enabled, an external interrupt will force the 8086 to ser- 
vice the interrupt. The address saved when the 8086 processes the interrupt is 
the address of the WAIT instruction. Thus, when the interrupt service routine 
returns, 
it returns to the WAIT instruction. 


2. 
The TEST signal is asserted. 


The encoding for this instruction 
is: 


WAIT-.- 
98 


Data 


Memory 
::1-- 
cx ~------------- 
DX ~------------- 
Program 
(Relative to the 


Memory 
CS Register) 


WAIT 
Number of Cycles: Minimum = 3 


Exchange the 16-bit contents 
of the specified register with the contents 
of the 
accumulator. 


The encoding for this instruction 
is: 


XCHG 
reg 


10010"r 
IL3 bits which specify the 16-bit register 
to be exchanged with the AX register. 


rrr = 000 for AX 
001 for CX 
010 for DX 
011 for BX 
100 for 5P 
101 for BP 
110 for 51 
111 for DI 


F 


XCHG 
BX 
Number of cycles: 3 


Data 


Memory 


Program 
(Relative to the 


Memory 
CS Register! 


1. 
No statuses are affected. 


2. 
The instruction 
XCHG AX, AX is the instruction which is used as the NOP instruc- 
tion by the 8086. 


This instruction swaps the contents of the register or memory location specified by 
the mem/reg 
operand with the contents of the register specified by the reg operand. An 
8- or 16-bit transfer may be specified. 
The encoding for this instruction 
is: 


XCHG 


1000011w 


reg,memlreg 


mod reg rIm 
I 
L Addressing 
mode byte(s) as described 
earlier in this chapter. 


w = 0 
a-bit exchange 
w = 1 16-bit exchange 


Evaluate the situation where the contents of the BX register are 6F3016, 
the SS register 
contains 2FOOI6, the SI register contains 004616, the BP register contains 020016, and the 
word stored at memory location 2F24616 is 415416, After the instruction 


XCHG 
aX,[ap 
+ Sil 


executes, the BX register will contain 415416, memory location 2F24616 will contain 3016 
and memory location 2F24 716 will contain 6F 16' 


3-276 
The 8086 Book 


Data 
(Relative to the 
0 
D 
T 
5 
Z 
A 
P 
C 
Memory 
55 Register! 


P5W 
vv 
ggggg 


ww 
ggggg + 1 


AX 
ggggg + 2 


BX 
xx 
yy 


CX 


DX 


Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
47 
ppppm 


BP 
hh 
hh 
1A 
ppppm + 


51 
ww 
ww 
ppppm + 2 


DI 
ppppm + 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
Program Memory 


Address Calculation 
55 
tt 
tt 


E5 


XCHG 
BX, [BP + 51) 


Number of cycles: 
memory with register: 
17 + EA 
register with register: 
4 


2, 
The segment 
registers 
may not be specified 
in this instruction, 
There 
is no instruc- 


tion to exchange 
segment 
registers, 


3. 
Typically, 
this 
instruction 
is not used 
to exchange 
a register 
with the AX register. 


The 
instruction 
XCHG 
reg is provided 
for this 
purpose. 


An 8-bit data element 
is loaded 
into the 
AL register. 
This data element 
is 
addressed 
by using the following algorithm: 


1. 
Add the 8-bit contents 
of the AL register to the 16-bit BX register. 


2. 
Use the result of the addition 
in step 
1 as the offset address for the DS 
register 
(assuming that no segment override has been executed). 


Instruction 
encoding for XLA Tis: 


XLAT 
-,.- 
D7 


For example, should the AL register contain OFI6, the BX register 004016, 
and the DS 
register FOOOI6, then executing the instruction 


XLAT 


P5W 
yy 
kkkkk 


kkkkk + 1 


AX 
xx 


BX 
gg 
gg 


CX 


DX 
Data Memory 
Address 
Calculation 
Program 
(Relative 
to the 


Memory 
C5 Register) 


5P 
D7 
ppppm 


BP 
ppppm 
+ 


51 
ppppm 
+ 2 


DI 
ppppm 
+ 3 


PC 
mm 
mm 


C5 
nn 
nn 


D5 
hh 
hh 
Program 
Memory 


Address 
Calculation 
55 


E5 


XLAT 
Number 
of cycles: 
11 


1. 
This instruction 
is most commonly used in the case where the BX register contains 


the beginning address of a table and the AL register is used as an index into the 
table. 


This instruction 
exclusive-ORs 
8- or 16-bit data elements 
with the AL (8-bit) or 


AX (l6-bit) 
register via immediate 
addressing. 
The encoding for this instruction 
is: 


XOR 
ac.dataL'T.""0'.'-""0""," 0' 
the 16-blt Immediate operand. 


---------kk 
is the low-order 8 bits of 


the 16-bit immediate operand. 
This byte is always present. 


w = a 
8-bit operation. AL is one of 


the operands and is the destination 
for the result. 


w = 1 
16-bit operation. AX is one of the 


operands and is the destination for 
the result. 


3-280 
The 8086 Book 


Data 
0 
0 
T 
S 
Z 
A 
P 
C 
Memory 


PSW 


AX 
xx 
yy 


BX 


CX 


OX 


Program 
(Relative 
to the 


Memory 
CS Register! 


SP 
35 
ppppm 


BP 
kk 
ppppm + 1 


SI 
ppppm + 2 


01 
ppppm + 3 


PC 
mm 
mm 


CS 
nn 
nn 


OS 
Program Memory 


SS 
Address Calculation 


ES 


XOR 
AX,jjkk 
Number of cycles: 
4 


1. 
This instruction 
performs 
the same 
function 
as the XRI 
data instruction 
in the 8080 
assem bly language, 
This instruction, 
however, 
also allows 
16-bit data units, 
whereas 
the 
8080 
XRI 
only 
uses 
8-bit data 
elements. 


mem/re9', 
mem/re92 


Register with Register 
Register with Memory 
Memory with Register 


Exclusive-OR 
the contents of the register or memory location specified by mem/ 


reg2 with the contents of the register or memory location specified by mem/reg" 
return- 


ing the result to mem/reh 
An 8- or 16-bit operation may be specified. Either mem/reg, 


or mem/reg2 
may be a memory operand, 
but one of the operands 
must be a register 


operand. 
The encoding for this instruction 
is: 


mem/reg 1, mem/regZ 


~_m 
_ 
- 
Addressing mode byte(s) 
as described earlier in 
this chapter. 


w = 0 
B-bit operation 
w = 1 
16-bit operation 


d is the Direction flag. 
If d = 0, then the operand 
described by mod and 
rim is mem/reg 1 and the 
operand described by 
reg is mem/regZ' 
If 
d = 1, then the operand 
described by mod and 
rim is mem/regZ 
and the 
operand described by 
reg is mem/reg 1. 


Suppose that the AX register contains 0787'6' the OS register contains 9080'6' the SI 
register contains 040E16, and the word at memory location 90COEI6is A6FO'6' After the 
instruction 


078716 
= 0000 
0111 
10110111 
A6F016 = 1010011011110000 


10100001 
010Qjlll1 


1 
t 
Four one bits, set P to 1 


AF is undefined 
Carry is cleared to 0 


'-----------Set 
the Sign status to 1 
Overflow is cleared to 0 
Non-zero result, set Z to 0 


XOR 
the 
immediate 
data 
in the 
succeeding 
program 
memory 
byte(s) 
with 
the 
specified 
register 
or memory 
location. 
An 8- or 16-bit 
operation 
may be specified. 


The 
encoding 
for this instruction 
is: 


XOR 
mem/reg.data 


110 0 0 00 owll_m_o_d -I-o-r/m_1 
jJ 
I 
L The high-order byte of the immediate 
operand.This byte is only presentif w = 1 


---------The 
low-order byte of the immediate 
operand.This byte is always present. 


Addressing mode byte(s) as described 
earlier in this chapter 


w = 0 
8-bit operation 


w = 1 
16-bit operation 


Consider 
the case where 
the DS register 
contains 
380016, 
the contents 
of the BX register 
are 020016, 
the DI register 
contains 
013616, 
and the word at memory 
location 
3833616 
is 
06B316• 
After the instruction 


XOR[BX + DII. 0805H 


has executed, 
the word at memory 
location 
3833616 
will be OEB616. 


06B316=0000 
0110 
1011 
0011 


050516 
= 0000 
1000 
0000 
0101 
0000 
1110 
101 1 
01 10L 
8 one bits. set P to 1 


Carry is cleared to 0 


------------Slgn 
IS set to 0 
Overflow IS cleared to 0 


Non-zero result. set Z to 0 
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Data 
(Relative 
to the 


0 
0 
T 
S 
Z 
A 
P 
C 
Memory 
OS 
Registerl 


PSW 


yy 
uuuuu 


AX 
xx 
uuuuu 
+ 1 


BX 
gg 
gg 


CX 


OX 


Program 
(RelatIve 
to the 


Memory 
CS Reglsterl 


SP 
B1 
ppppm 


BP 
31 
ppppm 
+ 
1 


SI 
kk 
ppppm 
+ 2 


01 
hh 
hh 
ppppm 
+ 3 


PC 
mm 
mm 


CS 
nn 
nn 


OS 
rr 
rr 


SS 


ES 


Number 
of cycles 
Memory 
operand: 
17 + EA 


Register 
operand: 
4 


1. 
This instruction 
is not typically used to XOR immediate 
data with AX or AL 


register. The instruction 
XOR ac,data is provided for this purpose. 


Some 
8086 assembler 
mnemonics 
do not expressly 
define 
whether 
an 8-bit or 16- 
bit operation 
is to be performed. 
All instructions 
which have one of the 8086 registers 
as 
an operand 
can 
use the 
register 
to determine 
whether 
an 8-bit 
or 16-bit 
operation 
is 
required. 
For example, 
if the instruction 
is 


XOR 
AX.OB04H 


a 16-bit operation 
is clearly 
in order. 
Some 
instructions, 
however, 
can perform 
an 8-bit 


or 16-bit operation, 
yet may not specify 
a register. 
These 
instructions 
include 
two basic 


types: 
the 
string 
operations, 
e.g., 
CMPS, 
LODS, 
and 
instructions 
which 
can 
have 
a 
single 
memory 
operand, 
e.g., 
MUL, 
NOT. 
Assem bIers can deal with this difficulty 
in one 
of three 
ways. These 
are: 


1. 
A character 
can be added 
to the mnemonic 
that 
will indicate 
whether 
a word 


or byte operation 
is specified. 
For example, 
M UL can be specified 
as 


MULB 
B x B-bit multiplication. 
or 
MULW 
16 x 
16-bit multiplication 


In the case of the string 
operations, 
the character, 
be it B (for Byte) or W (for 


Word) 
replaces 
the last character 
of the mnemonic. 
For example, 
the instruc- 
tion 
CMPS 
is replaced 
by: 


CMPB 
B-bit comparison 
CMPW 
16-bit comparison 


2. 
The operand 
for the string 
operations 
can be WORD 
or BYTE. 
For example, 


the MaYS 
instruction 
can be: 


In addition, 
this strategy 
can be applied 
to the single memory 
operand 
instruc- 
tions. 
For example, 


NOT 
51 
.BYTE 


NOT 
51 
.WORD 


3. 
The 
programmer 
defines 
all his data 
areas 
and symbols 
as WORD 
or BYTE 
elements. 
The assembler 
retains 
this information 
and then 
when 
a reference 


is made 
to the data area/symbol, 
the assembler 
determines 
whether 
an 8- or 
16-bit operation 
is necessary. 
For example, 


TOUCH$TONE$OUTPUT$BYTE 
TIMER 


an 8-bit operation 
is assembled 
for the NOT operation 
and a 16-bit operation 
is specified 
for the INC operation. 
Note 
that 
a particular 
assembler 
may use 


more 
than 
one 
of these 
options 
to perform 
its function. 


8086 
Instruction Groups 


This chapter contains another discussion of the 8086 instruction set. In contrast to 
the 
discussion 
presented 
in 
Chapter 
3, 
where 
each 
instruction 
was 
described 
individually, 
this chapter discusses groups of instructions. 
The 8086 instructions 
are 
grouped according to the functions they perform. These groups are: 


Data Movement 
Instructions 
Arithmetic 
Instructions 


Logical Instructions 
String Primitive Instructions 
Program Counter 
Control Instructions 


I/O Instructions 
Interrupt 
Instructions 


Rotate and Shift Instructions 


The 8086 instructions 
which perform data movement 
are shown in Table 4-1. 


8086 data movement 
instructions 
can be divided into three general categories: 


Instructions 
which move data from register to register or between memory loca- 
tions and registers. 


1. 
Instructions 
which move data from register to register or between memory 
locations and registers. 


2. 
Instructions 
which move data onto and off of the stack. 


3. 
Instructions 
which 
move 
multiple 
bytes 
from 
one 
memory 
location 
to 
another. 


The first and second types of instructions 
will be discussed in this section. Multi- 
ple byte instructions, 
which are created using instructions 
called string primitives, 
will 
only be discussed peripherally in this section; they will be discussed in detail later in this 
chapter. 


Data movement 
instructions 
are used in following types of routine: 


1. 
A routine to move the contents 
of BUFFER$A 
to BUFFER$B. 


2. 
A routine to initialize the contents 
of BUFFER$A. 


3. 
A routine to translate the contents of BUFFER$A. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


MOV 
mem/reg" 
100010dw 
2,3 or 
reg - reg: 2 
[mem/reg,] 
- 
[mem/reg2J' 
mem/reg2 
mod rrr rim 
4 
mem- reg: 
An 8- or 16-bit data element is moved from the memory 
IDISP) 
8 + EA 
location 
or 
register 
specified 
by 
mem/reg2 
into 
the 
IDISP) 
reg -mem: 
memory location or register specified by mem/reg,. 


9 + EA 


MOV 
mem/reg, 
1100011w 
3,4,5 
10 + EA 
[mem/reg] 
- 
data 
data 
mod 000 rim 
or6 
Move 8- or 16-bit immediate data into the memory loca- 


(DISPI 
tion or register specified by mem/reg. 


(DISPI 


kk 


jj (if w = 1) 


MOV 
reg,data 
1011wrrr 
2 or 3 
4 
[reg] - 
data 
kk 
Move 
8- 
or 
16-bit 
immediate 
data 
into 
the 
register 
lilifw= 
11 
specified by reg. 


MOV 
aC,mem 
1010000w 
3 
10 
[acl - 
[mem] 
kk 
Move data from the memory location specified by mem 


jj 
into the AL 18-bit operation) or the AX 116-bit operation I 
register. 


MOV 
mem,ac 
1010001w 
3 
10 
[meml - 
lac] 
kk 
Move data from 
the AL (8-bit operationl 
or AX (16-bit 
jj 
operationl 
register into the memory location specified by 
memo 


MOV 
segreg, 
8E 
2,3 or 
reg - reg: 2 
[segregJ - 
Imem/reg] 


mem/reg 
mod 0 ss rim 
4 
mem -reg: 
Move 16 bits of data from the memory location or register 
IDISP) 
8 + EA 
specified by mem/reg 
into the selected segment register. 
IDISP) 
If ss = 01, this operation is undefined. 


MOV 
mem/reg, 
8C 
2,3 or 
reg - reg: 2 
[mem/reg] 
- 
!segregl 
segreg 
modOss 
rim 
4 
mem - reg: 
Move the contents 
of the selected segment register into 
IDISP) 
9 + EA 
the specified memory location or register. 


IDISP) 


XCHG 
mem/reg" 
1000011w 
2,3 or 
reg - reg: 4 
[mem/reg,]-- 
[mem/reg2]' 
mem/reg2 
mod rrr rim 
4 
reg -mem: 
Exchange the 8- or 16-bit contents of the memory loca- 


IDISP) 
17 + EA 
tion 
specified 
by mem/reg 1 with 
the 
contents 
of the 


(DISP) 
memory location or register specified by mem/reg2. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


XCHG 
reg 
10010rrr 
1 
3 
[AX) -- 
[reg] 
Exchange the contents 
of the AX register with the con- 
tents of the selected register. 


XLAT 
07 
1 
11 
[All 
- 
[[All 
+ [BX]] 
Load the data byte addressed by summing AL with 
BX 
into the AL register. 


LOS 
reg,mem 
C5 
2,3 or 
16 + EA 
[reg] - 
[mem], [OS) - 
[mem + 2) 
mod rrr rim 
4 
Load 16 bits of data from the memory location specified 
(DISP) 
by mem into the selected register. Load 16 bits of data 
(DISP) 
from the memory location following the memory location 
specified by mem into the OS register. 


LEA 
reg,mem 
80 
2,30r 
2 +EA 
[reg) - 
mem (offset portion of address) 
mod rrr rim 
4 
Move the 16 bits which 
are the offset 
portion 
of the 
(DISP) 
memory address into the selected register. 


(DISP) 


LES 
reg,mem 
C4 
2,3 or 
16 + EA 
[reg] - 
[mem], [ES] - 
[mem + 2) 
mod rrr rim 
4 
Load 16 bits of data from the memory location specified 
(DISP) 
by mem into the selected register. Load 16 bits of data 
(DISP) 
from the memory location following the memory location 
specified by mem into the ES register. 


PUSH 
memlreg 
FF 
2,3 or 
reg: 11 
[SP) - 
[SP] - 
2, [[SP]] - 
Imem/reg) 
mod 110 rim 
4 
mem: 
Decrement SP by 2. Store the 
16-bit 
contents 
of the 
(DISP) 
16 + EA 
memory location or register specified by memlreg 
onto 
(DISP) 
the top of the stack. 


PUSH 
reg 
01010rrr 
1 
10 
[SP) - 
[SP] - 
2, [[SP]] - 
[reg] 
Decrement SP by 2. Store the 
16-bit 
contents 
of the 
specified register onto the top of the stack. 


PUSH 
segreg 
000ssl10 
1 
10 
[SP) - 
[SP] - 
2, [[SP]] - 
[segreg] 
Decrement 
SP by 2. Store the 
16-bit 
contents 
of the 


specified register onto the top of the stack. 


PUSHF 
9C 
1 
10 
[SP) - 
[SP] - 
2, [[SP]] - 
[FLAGS) 


Decrement SP by 2. Store the contents 
of the FLAGS 
register onto the top of the stack. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


POP 
mem/reg 
8F 
2,3 or 
reg: 8 
[mem/reg] 
- 
[[SP]L [SP] - 
[SP) + 2 
mod 000 r/m 
4 
mem: 
Move the 16 bits at the top of the stack into the memory 
(DISP) 
17 + EA 
location or register specified by mem/reg. Increment SP 
(DISPI 
by 2. 


POP 
reg 
01011", 
1 
8 
[reg) - 
[[SP)L [SP) - 
[SP) + 2 


Move the 16 bits at the top of the stack into the specified 
register. Increment SP by 2. 


POP 
segreg 
000ss111 
1 
8 
[segreg) -[[SPIL 
[SP) -[SP) 
+ 2 
Move the 16 bits at the top of the stack into the specified 
segment 
register. Increment 
SP by 2. If ss = 01, this 
operation is undefined. 


POPF 
9D 
1 
8 
X 
X 
X 
X 
X 
X 
X 
X 
X 
[FLAGS] -[[SP)L 
[SP] -[SP) 
+ 2 


Move the 16 bits at the top of the stack into the FLAGS 
register. Increment SP by 2. 


LAHF 
9F 
1 
4 
Transfer the 8080 
flags to the AH register. 
= 
- AH 


S 
Z 
A 
P 
C 


SAHF 
9E 
1 
4 
X 
X 
X 
X 
X 
Transfer the AH register to the 8080 
flags. 
= 


- AH 


S 
Z 
A 
P 
C 


Two elementary 
buffer-to-buffer 
move routines are shown in Figures 4-1 and 4-2 
for 8-bit and 16-bit data elements, 
respectively. 
These 
routines 
assume 
that the SI 
register contains 
the address of BUFFER$A, 
the DI register contains the address of 
BUFFER$B, 
and the ex register contains the number 
of data elements 
to move. 


MOV 
MOV 
INC 


INC 
DEC 


JNZ 
RET 


AL.ISIl 
[DIIAL 


SI 


DI 
CX 
MOVE$BYTES 
;DECREMENT '* TO MOVE 
;LOOP IF NOT DONE 


;LOAD BYTE FROM SOURCE 


;STORE BYTE INTO DESTINATION 
;ADJUST 
POINTERS 


MOV 
MOV 
AX.[SIl 
[DIJ.AX 
;LOAD WORD FROM SOURCE 
;STORE WORD INTO DESTINATION 


;ADJUST 
POINTERS 


SI 
SI 
DI 
DI 
CX 
MOVE$WORDS 


;DECREMENT '* TO MOVE 
;LOOP IF NOT DONE 


The instruction sequences illustrated in Figures 4-1 and 4-2 move data in the Data 
Segment. The BX register can be used in place of the SI or DI register in these routines. 
The routines 
as illustrated are simple to follow, but they are not very efficient. 


String primitive operations, 
which are described later in this chapter, may provide much 
more efficient buffer-to-buffer 
move routines; also, the LOOP instruction 
improves the 
efficiency of any decrement 
and branch program logic. 


There 
are 
many 
ways of initializing 
registers 
that 
are 
used 
by instruction 
sequences 
such 
as the 
buffer-to-butTer 
move 
routines. 
The initialization 
methods 
depend on how the addresses 
and the count are obtained. 
An additional factor is the 
number 
of registers that need to be initialized; 
for example, 
in most cases, the OS 
register will already be initialized. 
The butTer beginning addresses and the byte or word count for the buffer to buffer 
move routine can be held in a block of memory words pointed to by a register. Consider 
the following eight-byte memory block: 


=========== 
} 0"'" ,'d"" ,,,""'0' """" 


=========== 
} '"m,"' ,'d"" '0' os 


=========== } 0",'' "',," '0'""'"'''0" """" 
==}CO"", 


Memory byte pairs hold four addresses, 
as illustrated above. 
The block of memory 
words illustrated 
above is frequently 
referred 
to as a 
parameter 
block; the individual data values in the block are parameters. 
. 


The Dr register could be loaded with the starting address of this block (xxxx in the 
illustration 
above) to provide the parameters 
for the initialization sequence. 


LOS 
SUOI]] 
MOV 
eX.[OI + 6] 


________ 
M_O_V 
O_U_O_I 
_+_4_1 
_ 


The BX register can be used instead of the 01 register, if more convenient. 
When a string primitive instruction 
is used, the parameter 
block may have to be 
expanded 
as shown below to include a segment 
address which is loaded into the ES 
register: 


=========== 
} 0""' ""'.,, 'm ,,"., 
'"",' 


=========== } S"m,"' ."." 
'm os 


=========== 
} 0"''' ."." 'm ,,"'""'," 
'""" 


=========== 1S"m,"' ."." 
'm " 
===},'"", 


This technique allows data to be moved from any location to any other in the one 


million-byte memory space. If segments are not specified, data is moved within the cur- 
rent segments 
only. 


Again loading the 01 register with the beginning address for this block (xxxxx in 
the illustration 
above), 
the initialization sequence 
becomes. 


LOS 
MOV 
LES 


Sl.!oll 
ex.[ol + 81 
01.[01 + 4] 


If one of the buffers is always in a fixed location in memory, the address of the 
fixed buffer can be specified as immediate 
data. Consider 
the following instruction 
sequence. 


MOV 
MOV 
MOV 
MOV 
LES 


SI,AOOR$FOR$8UFFER$A 
AX. SEGAOOR$FOR$8UFFER$A 
OS,AX 
eX,[OI + 4J 
01,[011 


In the first instruction, 
the immediate data ADDR$FOR$BUFFER$A 
is moved 
into 
the 
SI 
register. 
The 
second 
instruction 
moves 
the 
immediate 
data 
SEGADDR$FOR$BUFFER$A 
into the 
AX register. 
This instruction 
is necessary 
because 
the 8086 has no instructions 
which move immediate 
data into a Segment 
register. (An exception is an inter-segment 
Jump instruction 
which loads a 16-bit seg- 
ment address into the Code Segment register.) The third instruction moves the segment 
address into the OS register. Recall that oft times, the OS register is already set to the 
desired value and no modification 
is necessary. The fourth and fifth instructions 
are 
used to load the count and destination 
buffer address into the appropriate 
registers. 
These instructions 
require that 01 point to a block of the following form: 


=========== 
} 0"'" •••,•••,,, "',,'""'''" '0"" 


=========== 
} '''~"' 
." .... ,,," 
=_-=}"o", 


Parameters 
(in this case address and count information) 
can be passed to a routine 
via the stack. For the buffer-to-buffer 
move this may be illustrated as follows: 


=========== 
} '''om ." •••• 


=========== 
} 0"'".",••,,,,""''' 'off •• 


} ,.,m."' A""•• 


============ } 0""' ." ••••,,,"''''"••,~'0". 


} 
000"' 


Figure 
4-6. 
Buffer 
Move 
Register 
Initialization 
via Stack 


and 
Pop Instructions 


This approach 
makes 
it difficult to return 
from a subroutine 
using the RET 
instruction. 
But the 8086 allows a register to supply the return address. Therefore, 


could be used instead of RET. If this approach appears intrinsically ugly, or if all the 
registers are in use, consider the following sequence: 


PUSH 
MOV 
MOV 
MOV 
MOV 
MOV 


BP 
BP.SP 
SI, IBP + 41 
OS.IBP + 6J 
01, IBP + 81 
CX.IBP + 101 


Figure 
4-7. 
Buffer 
Move 
Register 
Initialization 
via Stack 


and 
Indirect 
Addressing 


These instructions 
will perform the desired initializations. 
The routine may then 


be terminated 
with a 


MOV 
SP.BP 
POP 
BP 
RET 
8 


which will move the return 
address into the program counter 
and then add 8 to the 
adjusted 
stack pointer, 
thus removing 
parameters 
from the stack; these have been 


pushed by the called routine. 


If the buffers are present in the current data segment, the buffer addresses can be 


loaded using the LEA (Load Effective Address) 
instruction. 
The following sequence 


loads SI and 01 using the LEA instruction, 
then loads the COUNT data from memory 
into ex using the MOV instruction 


LEA 
LEA 
MOV 


SI.BUFFER$A 
OI.BUFFER$B 
CX.COUNT 


Figure 
4-8. 
Buffer 
Move 
Register 
Initialization 


using 
LEA 
Instruction 


Another 
example assumes 
that the first two bytes of BUFFER$A 
contain 
the 
number of bytes in the buffer, and therefore the number of bytes to be moved. Here is 
the resulting parameter 
block: 


=========== 
} 0"'" .""" '"''"0'" "'''" 


=========== 
} '"m,o< .""" 
'"''' 


=========== 
} 0"''' .""" '"' ,~"""'"""'''" 


If the 
DI register 
points 
to this parameter 
block, the following initialization 
sequence could be used: 


LOS 
SUOII 


MOV 
01, [01 + 41 


MOV 
CX. [SI] 


INC 
SI 
INC 
SI 


Two buffer initialization routines are shown below. The first routine replicates a 8- 
bit pattern through a buffer; the second routine replicates a 16-bit pattern through the 
buffer. Frequently such routines are used to clear a buffer, in which case the 8-bit or 16- 
bit value will be O.You will use the first routine to clear a short buffer with an odd byte 
length; you use the second routine to clear a buffer with an even byte length, or a long 
buffer with an odd byte length (use a single byte instruction to clear the odd byte). There 
are occasions when a buffer must be initialized with some non-zero pattern; for exam- 
ple, an ASCII space code might be used to initialize a buffer that is eventually to hold 
ASCII character strings. 
The routines described in Figures 4-9 and 4-10 assume that the DI register points 
to the destination 
buffer. The AL or AX register contains the 8-bit or 16-bit value to be 
replicated through the buffer. The CX register specifies the number of bytes or words in 
the buffer. 


MOV 
INC 
DEC 
JNZ 
RET 


[DI!.AL 
DI 
CX 
INITIAlIZE$LOOP 


;STORE INITIALIZING 
DATA 
;ADJUST 
POINTER 


;DECREMENT AND BRANCH 
;11not done 


MOV 


INC 
INC 
DEC 
JNZ 
RET 


[DI!.AX 
DI 
DI 
CX 
INITIAlIZE$LOOP 


The BX or SI registers can be used instead of DI in the two buffer initialization 
programs. 


Sometimes the first n bytes of a buffer are used to describe the buffer. For exam- 
ple, the total length of the buffer and displacement 
to the first empty byte might be 
stored in the first two buffer bytes. These buffer descriptive 
bytes must be adjusted 
when data is written into the buffer. 


The buffer initialization routine must itself have registers initialized, as described 
for the buffer-to-buffer 
move routines. 


In general, the address/count 
information 
is delivered to the routine in one of the 
following ways: 


• In a parameter 
block 


• On the stack 


When a buffer is translated, 
every element 
in the buffer is converted, 
using a 
translation table to make the conversion. 
The translation table provides a direct replace- 
ment value for every initial value that an element 
can have. For example, 
if a buffer 
consists of one-byte elements, 
then there are 256 possible initial values that each ele- 
ment can have, and similarly there are 256 translated 
values that the same element can 
have. The translation 
table will link each initial value to a translated 
value. Perhaps the 
most frequently seen translation table converts between ASCII and EBCDIC characters, 
each of which is encoded as a byte value. In this case, if a buffer of ASCII characters 
were translated, 
the result would be a buffer of equivalent 
EBCDIC characters. 


Consider two ways in which a buffer may be translated: 


I. 
Data within the buffer may be translated 
and left in the buffer. 


2. 
Data may be translated 
while being moved from one buffer to another. 


The 
routine 
in Figure 
4-11 translates 
data without 
moving 
it. This 
routine 
assumes that the BX register contains the address of a translation 
table, the SI register 
contains 
the address of the buffer to be translated, 
and the CX register contains 
the 
number 
of data elements 
to be translated. 


;LOAD FROM BUFFER 
;INDEX INTO TABLE 
;STORE CONVERTED DATA 
INTO BUFFER 
;POINT AT NEXT ELEMENT 
;DECREMENT AND TEST FOR DONE 


MOV 
XLAT 
MOV 
INC 
DEC 
JNZ 
RET 


ISI!.AL 
SI 
CX 
TRANSLATE$LOOP 


The routine in Figure 4-11 assumes that the element being translated maps into a 
256-byte table. This assumption 
allows the XLA T instruction 
to be used. If the element 
to be translated 
is a 16-bit data unit, a larger table may be necessary. The routine 
in 
Figure 4-12 maps 16-bit data elements into a 65K-byte table, producing an 8-bit result. 


MOV 
MOV 
MOV 
INC 
INC 
DEC 
JNZ 
RET 


DI.ISI] 
AX.IBX 
+ DII 


[SI!. AX 
SI 
SI 
CX 
TRANSLATE$LOOP 


;LOAD ELEMENT 
;USE ELEMENT AS INDEX 
;STORE RESULT 
;UPDATE 
POINTERS 


;DECREMENT AND TEST 
;FOR DONE 


TRANSlATE$lOOP: 
MOV 
XlAT 
MOV 
INC 
INC 
DEC 
JNZ 
RET 


Al,[SIl 


[DIlAl 
SI 
DI 
CX 
TRANSlATE$lOOP 


;lOAD 
ELEMENT FROM SOURCE BUFFER 


;TRANSlATE 
DATA 
;STORE CONVERTED DATA 
IN DESTINATION 
BUFFER 


;UPDATE 
POINTERS 


The routine 
in Figure 4-13 translates 
data while moving it from one buffer to 
another. This routine assumes that the BX register contains the address of the conver- 
sion table, the SI register contains 
the address of the buffer to be translated, 
the DI 


register contains the address of the buffer where the translated data will be stored, and 
the CX register contains the number 
of data elements 
to be translated, 
The routine in Figure 4-13 assumes that both of the buffers are present in the seg- 


ment addressed 
by the DS register. 


Many translation 
routines also check that all the elements in the translated buffer 


lie between specified boundary values. The routines in Figure 4-11 through 4-13 will be 
updated to include such a check later in this chapter. 


Register initialization 
for these routines 
is similar to the initialization 
methods 
used by the buffer-to-buffer 
move routines. 


The 8086 has fourteen 
16-bit registers. In most cases having so many registers is 
very desirable. But when the state of the entire machine must be saved while appropriate 
processing is performed, 
and then the state of the machine must be restored, 
an abun- 
dance of registers is less of an asset. 


When a hardware or software interrupt 
occurs, for example, 
the 8086 saves the 
contents of the Flags register, the program counter, and the Code Segment register dur- 
ing its interrupt 
acknowledge 
sequence. 
The interrupt 
service routine 
must save the 
complete state of the machine, The following instruction 
sequence will accomplish this 
task. 
There is no specific order in which the registers must be pushed; 
however the 
registers must be restored in the inverse order from which they were pushed. If registers 
are saved using the sequence illustrated in Figure 4-14, the following sequence must be 
used to restore the registers. 


PUSH ES 
PUSH DS 
PUSH SI 
PUSH DI 
PUSH BP 
PUSH DX 
PUSH ex 
PUSH BX 
PUSH AX 


POP AX 
POP BX 
POP ex 
POP DX 
POP BP 
POP DI 
POP SI 
POP DS 
POP ES 


To save the state of the entire machine requires 
11 bytes of code and 110 clock 
periods. The 110 cycles do not include time the 8086 uses to respond to the interrupt. 
For hardware interrupts 
the 8086 requires 62 clock periods to acknowledge the inter- 


rupt. For software interrupts 
the 8086 requires 51 to 53 clock periods to acknowledge 
the interrupt. 
These response cycles follow execution 
of the instruction 
during which 
the interrupt occurred. Thus, saving the state of the entire machine may take as many as 
172 clock periods, requiring 32.4 microseconds 
on a 5 MHz 8086. Restoring the state of 
the machine requires 110 clock periods, plus an additional 24 clock periods for an IRET 
(Interrupt 
Return) 
instruction. 
Therefore 
to save and then restore the 8086 machine 
state may take up to 306 clock periods, or 61.2 microseconds 
per interrupt. 


If a program is written to run in conjunction 
with an operating system, then the 
operating 
system will typically initialize segment 
registers, 
and subsequently 
modify 
their contents, as needed. If a program is written to run without the benefit of an operat- 
ing system, then the segment 
registers must be initialized. The instructions 
shown in 
Figure 4-16 will initialize the segment registers. 


MOV 
MOV 
MOV 
MOV 
MOV 
MOV 


AX,IMM$DATA$FOR$DS 
DS,AX 
AX, IMM$DA TA$FOR$ES 
ES, AX 
AX, IMM$DA TA$FOR$SS 
SS, AX 


Another way to initialize segment registers is to move data directly from memory 
into the segment registers, as shown in Figure 4-17. 


MOV 
MOV 
MOV 


DS, cs: DATA$FOR$DS 
ES, cs: DATA$FOR$ES 
SS, cs: DATA$FOR$SS 


Figure 4-17, 
Initializing the ES Register 


via Code Segment Locations 


The segment prefixes for the second and third instructions 
may be eliminated 
if 
the data for Segment Registers ES and SS are contained 
in the segment addressed 
by 
DS. 


The 8086 provides special protection for a particular initialization sequence, When 
the SS and SP registers are initialized by consecutive 
MOY instructions, 
the 8086 will 
not allow an interrupt 
to occur between the MOY instructions. 
Thus, 


MOV 
SS, cs: 
DATA$FOR$SS 
MOV 
SP, DATA$FOR$SP 


There are these five types of 8086 arithmetic 
instructions: 


1. 
Addition instructions 


3. 
Multiplication 
instructions 


4. 
Division instructions 


5. 
Compare instructions 


Each of the above categories, except for compare instructions, 
has variations that 
allow for ASCII/BCD 
operations. 


Instructions 
that perform various types of addition are shown in Table 4-2. 
Figures 4-18, 4-19, and 4-20 illustrate the use of various addition instructions. 
Each of the routines assumes that the numbers or strings to be added are present in the 
Data Segment, and are ordered as follows: 


_____ 
~ 
least significant byte 


_____ 
~ 
Most significant byte 


The routine in Figure 4-18 assumes that the SI and DI registers contain the start- 


ing addresses for the multiword numbers to be added, and the CX register contains the 
number of words to add. The result is stored in the string pointed to by the DI register. 


ClC 
MOV 
ADC 
INC 
INC 
INC 
INC 
DEC 
JNZ 
RET 


AX.[SIl 
[Dll.AX 
SI 
SI 
DI 
DI 
CX 
ADDITION$lOOP 


;ClEAR 
CARRY FOR INITIAL ADDITION 
;lOAD 
FROM INITIAL STRING 


;ADD AX TO MEMORY 
;UPDATE 
POINTERS 


String primitives and the LOOP instructions 
can reduce the number of memory 
ocations, and the time required to perform this routine. 


Status 


Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


ADC 
mem/reg" 
000100dw 
2,3 or 
reg - reg: 3 
X 
X 
X 
X 
X 
X 
[mem/reg,] 
-[mem/reg,1 
+ (mem/reg2] 
+ [CI 
mem/reg2 
mod rrr rim 
4 
mem - reg: 
Add the 8- or 16-bit 
contents 
of the memory 
location 
or 
IDISP) 
9 + EA 
memory 
register 
specified 
by mem/reg2 
and the Carry 


IDISP) 
reg - mem: 
status to the 8- or 16-bit contents 
of the memory location 
16 + EA 
or register selected by mem/reg,. 


ADC 
mem/reg, 
100000sw 
3,4,5 
reg: 4 
X 
X 
X 
X 
X 
X 
[mem/reg] 
-[mem/regl 
+ data + ICI 


data 
mod010r/m 
or6 
mem: 
Add the 8 or 16 bits of immediate 
data and the Carry 
(DISPI 
17 + EA 
status to the 8- or 16-bit contents 
of the memory location 
(DISPI 
or register 
selected 
by mem/reg. 


kk 


jjlifsw=Oll 


ADC 
ac, data 
0001010w 
2or3 
4 
X 
X 
X 
X 
X 
X 
[ac] -lac] 
+ data + [C] 
kk 
Add the 8 or 16 bits of immediate 
data and the Carry 
jj(ifw=Ol) 
status to the AL (8-bit operation) 
or AX (16-bit 
operation) 
register. 


ADD 
mem/reg" 
OOOOOOdw 
2,3 
or 
reg - reg: 3 
X 
X 
X 
X 
X 
X 
[mem/reg,] 
-[mem/reg,] 
+ [mem/reg2] 
mem/reg2 
mod rrr rim 
4 
mem- 
reg: 
Add the 8- or 16-bit 
contents 
of the memory 
location 
or 
(DISPI 
9 + EA 
register 
specified 
by mem/reg2 
to the 8- or 16-bit 
con- 
(DISPI 
reg - mem: 
tents 
of 
the 
memory 
location 
or register 
selected 
by 
16 + EA 
mem/reg,. 


ADD 
mem/reg, 
100000sw 
3,4,5 
reg: 4 
X 
X 
X 
X 
X 
X 
Imem/regJ 
-(mem/regJ 
+ data 
data 
mod 000 rim 
or6 
mem: 
Add the 8 or 16 bits of immediate 
data to the 8 or 16 bit 
(DISP) 
17 + EA 
contents 
of the memory 
location 
or register 
selected 
by 
(DISP) 
mem/reg. 
kk 
jj lif sw = Oll 


ADD 
aC,data 
0000010w 
20r3 
4 
X 
X 
X 
X 
X 
X 
[acl - 
[ac] + data 


kk 
Add the 8 or 16 bits of immediate 
data to the AL 18-bit 


jj lifw 
= II 
operation) 
or AX (1 6-bit 
operation) 
register. 


INC 
mem/reg 
1111111w 
2,3 
or 
reg: 3 
X 
X 
X 
X 
X 
[mem/regl 
- 
[mem/regJ 
+ 1 
mod 000 rim 
4 
mem: 
Increment 
by 1 the 8 or 16 bit contents 
of the memory 
(DISPI 
15 +EA 
location 
or register selected 
by mem/reg. 


(DISPI 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 


0 
0 
I 
T 
S 
Z 
A 
P 
C 


INC 
reg 
01000rrr 
1 
2 
X 
X 
X 
X 
X 
[reg] - 
[reg] + 1 


Increment 
by 
1 
the 
16-bit 
contents 
of 
the 
specified 
register. 


AAA 
37 
1 
4 
? 
? 
? 
X 
] 
X 
ASCII adjust the contents of the AL register after an addi- 
tion. 


DAA 
27 
1 
4 
? 
X 
X 
X 
X 
X 
Decimal adjust the contents 
of the AL register after an 
addition. 


The routine in Figure 4-19 assumes that the SI and Dl registers contain the start- 
ing addresses for the BCD strings to be added, and the CX register contains the number 
of BCD bytes in each BCD string. The result is stored in the string pointed to by the DI 
register. 


START: 
BCD$ADDITION$LOOP: 
AL.ISI! 
AL,[DI! 


;CLEAR CARRY FOR INITIAL ADDITION 
;LOAD FROM STRING A 
;ADD FROM STRING B 
;PERFORM BCD ADJUST 
;STORE RESULT 
;UPDATE 
POINTERS 


CLC 
MOV 
ADC 
DAA 
MOV 
INC 
INC 
DEC 
JNZ 
RET 


IDll.AL 
SI 
DI 
CX 
BCD$ADDITION$LOOP 


;DECREMENT AND TEST 
;FOR DONE 


The routine in Figure 4-20 assumes that the SI and Dl registers contain the start- 
ing addresses for two ASCll strings that are to be added. The CX register contains the 
number of ASCll bytes in each string. The result will be stored in the string pointed to 
by the DI register. 


CLC 
MOV 
ADC 
AAA 
MOV 
INC 
INC 
DEC 
JNZ 
RET 


AL.ISI! 
AL,[DI] 


IDI!,AL 
SI 
DI 
cx 
ASCII$ADDITION$LOOP 


;LOAD FROM STRING A 
;ADD STRING B 
;PERFORM AN ADJUST 
;STORE RESULT 
;ADJUST 
POINTERS 


;DECREMENT AND TEST 
;FOR DONE 


The routines in Figures 4-19 and 4-20 can both use the string primitives to reduce 
the number 
of bytes and the amount of time required to perform these operations. 


For the above addition routines, consider the case where the numbers to be added 
have the following format: 


In this case, addition routines would differ from Figures 4-18 through 4-20 in two 
major respects: 


1. 
The initialization 
sequence 
would differ. The initialization 
sequences 
would 
point registers at the last byte of the multibyte number, 
instead of the first. 


2. 
The pointers would be decremented, 
not incremented. 


To account for these differences, 
modified starting addresses must be loaded into 
appropriate 
address registers. Subsequently 
addresses 
must be decremented. 


Subtraction 
instructions 
are shown in Table 4-3. 
The subtraction 
versions of the multibyte addition routines presented in the pre- 
vious section are easily derived. Creation of these routines 
is left to the reader as an 
exercise. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


SUB 
mem/reg" 
001010dw 
2,3 or 
reg - reg: 3 
X 
X 
X 
X 
X 
X 
Imem/reg,] 
- 
[mem/reg,1 
- [mem/reg2] 


mem/reg2 
mod rrr rim 
4 
mem - reg: 
Subtract 
the 8 or 16 bit contents 
of the memory 
locatic 
(DISP) 
9 + EA 
or register 
specified 
by mem/reg2 
from 
the 8 or 16 b 


(DISP) 
reg· mem: 
contents 
of the memory 
location 
or register specified 
t 


16 + EA 
mem/reg,. 


SUB 
mem/reg 
100000sw 
3,4,5 
reg: 4 
X 
X 
X 
X 
X 
X 
Imem/reg] 
- 
Imem/reg] 
- 
data 


data 
mod 101 rim 
or6 
mem: 
Subtract 
8 or 16 bits of immediate 
data from the 8 or 1 
(DISPI 
17 + EA 
bit contents 
of the memory 
location 
or register specifie 


(DISP) 
by mem/reg. 


kk 
jj lifsw = 01) 


SU8 
aC,data 
0010110w 
2 or 3 
4 
X 
X 
X 
X 
X 
X 
[acl - 
[acl - 
data 
kk 
Subtract 8 or 16 bits of immediate 
data from the AL (8-b 
jj (ifw = 1) 
operation) 
or AX l16-bit 
operation) 
register. 


S8B 
mem/reg" 
000110dw 
2,3 
or 
reg - reg: 3 
X 
X 
X 
X 
X 
X 
[mem/reg, 
) - 
Imem/reg,] 
- [mem/reg2] 
- 
[C] 
mem/reg2 
mod rrr rim 
4 
mem - reg: 
Subtract 
the 8 or 16 bit contents 
of the memory 
locatio 


IDISP) 
9 + EA 
or register 
specified 
by mem/reg2 
and the Carry statu 


IDISPI 
reg -mem: 
from 
the 8 or 16 bit contents 
of the memory 
location 
( 


16 + EA 
register specified 
by mem/reg. 


S8B 
mem/reg, 
100000sw 
3,4,5 
reg: 4 
X 
X 
X 
X 
X 
X 
[mem/regJ 
- 
Imem/reg] 
- 
data - 
IC] 


data 
mod011 
rim 
or 6 
mem: 
Subtract the 8 or 16 bits of immediate 
data and the Carr 
(DISP) 
17 + EA 
status from the 8- or 16-bit contents 
of the memory locs 


(DISP) 
tion specified 
by mem/reg,. 


kk 
jj lif sw = 01) 


S88 
aC,data 
0001110w 
2 or 3 
4 
X 
X 
X 
X 
X 
X 
lac] - 
[acl - 
data - 
IC] 
kk 
Subtract 
the 8 or 16 bits of immediate 
data and the Carr 


jjlifw 
= 11 
status from the AL 18-bit operation) 
or AX (16-bit 
opera 


tion) register. 


DEC 
mem/reg 
1111111w 
2,3 or 
reg: 
3 
X 
X 
X 
X 
X 
[mem/regl 
- 
Imem/reg] 
- 
1 
mod 00' 
rim 
4 
mem: 
Decrement 
by 1 the 8 or 16 bit contents 
of the memor 
(DISPI 
15 + EA 
location 
or register 
selected 
by mem/reg. 
(DISPI 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


DEC 
reg 
01001rrr 
1 
2 
X 
X 
X 
X 
X 
[regl - 
[reg] - 
1 
Decrement by 
1 the 
16 bit contents 
of the specified 
register. 


AAS 
3F 
1 
4 
? 
? 
? 
X 
? 
X 
ASCII adjust 
the contents 
of 
the 
AL register 
after 
a 
subtraction. 


DAS 
2F 
1 
4 
? 
X 
X 
X 
X 
X 
Decimal adjust the contents 
of the AL register after a 
subtraction. 


NEG 
memlreg 
1111011w 
2,3 or 
reg: 3 
X 
X 
X 
X 
X 
X 
[reg] - 
[reg] + 1 
mod011 
rim 
4 
mem: 
Twos complement the 8 or 16 bit contents of the memory 
(DISP) 
16 + EA 
location or register specified by mem/reg. 
(DISP) 


The routines in Figures 4-21 and 4-22 illustrate typical uses of8086 multiplication 
instructions. 


The routine in Figure 4-21 multiplies two 32-bit unsigned numbers, 
generating a 
64-bit result. This routine operates on a data block having the following form. 


MOV 
AX, [BXI 
;MUL TIPLY LOW-ORDER 
16 BITS 
MUL 
[BX + 4] 
;BY LOW-ORDER 
16 BITS 


MOV 
[BX + 8J.AX 
;SAVE RESULT, WHICH IS IN AX 


MOV 
[BX + 10J.DX 
;AND DX 


MOV 
AX, [BXI 
;MUL TIPL Y LOW-ORDER 
16 BITS OF 
;OPERAND A BY HIGH-ORDER 16 BITS 


MUL 
[BX + 61 
;OF OPERAND B 


ADD 
[BX + 10J.AX 
;ADD TO PREVIOUS RESULT 
ADC 
[BX + 121.DX 
;ASSUME 
RESULT BYTES 
JNC 
NEXT$MUL 
;ARE INITIALLY ZERO 
INC 
[BX + 141 


MOV 
AX,[BX 
+ 21 
;MUL TIPL Y HIGH-ORDER 16 BITS OF 
;OPERAND A BY LOW-ORDER 
16 BITS 


MUL 
[BX + 4] 
;OF OPERAND B 


ADD 
[BX + 10J.AX 
;ADD TO PREVIOUS RESULT 
ADC 
[BX + 121.DX 
INC 
HIGH$ORDER$MUL 
INC 
[BX + 141 
;SAVE CARRY 


MOV 
AX, [BX + 21 
;MUL TIPL Y HIGH-ORDER 16 BITS 
;OF OPERAND A BY HIGH-ORDER 16 


MUL 
[BX + 61 
;BITS OF OPERAND B 


ADD 
[BX + 12J.AX 
;ADD TO PREVIOUS RESULT 
ADC 
[BX + 14J.DX 
;ADD TO PREVIOUS RESULT 
RET 


The routine in Figure 4-22 multiplies an ASCII string by a single ASCII digit. The 
result is a string of unpacked BCD digits, The routine assumes that the ASCII string is 
organized in the following manner. 


Status 
Mnemonic 
Operands 
Object Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


MUL 
reg 
11110110 
2 
70 - 
77 
X 
] 
] 
] 
] 
X 
If w = o. [Axi 
- 
[All' 
Imem/regl 


IS-bltl 
11100 reg 
if w = 1. [OX] [AX] - 
[AX] • [mem/reg] 


reg 
11110111 
2 
11S-133 
Multiply the S- or 16-bit contents of the memory location 


116-bltl 
11100 reg 
or register specified by mem/reg 
with the contents of the 


mem 
11110110 
2.3 
or 
176 - 
S31 
AL IS-bit operationl or AX 116-bit operationl register. The 


IS-bltl 
mod 100 r m 
4 
+ EA 
result is stored in the AX register. in the case of an SXS -bit 


10lSPI 
operation. or the OX register Ihigh-order 
16 bits I and the 


10lSPI 
AX register (low-order 16 bitsl in the case of a 16 x 16-bit 


mem 
11110111 
2.3 or 
1124-139 
operation. This is an unsigned multiplication operation. 


116-bltl 
mod 100 rim 
4 
+ EA 
The execution 
time 
may vary 
by 
7 clocks 
for 
S-bit 


10lSPI 
operands and 15 clocks for 16-bit operands. 


10lSPI 


IMUL 
reg 
11110110 
2 
SO - 
9S 
X 
] 
] 
] 
? 
X 
if w = o. [AX] - 
[All· 
Imem/reg] 
IS-bitl 
11101 reg 
if w = 1. [OX] [AX] - 
[AX] • [mem/reg] 


reg 
11110111 
2 
12S-154 
Multiply the S- or 16-bit contents of the memory location 


116-bitl 
11101 
reg 
or register specified by mem/reg 
with the contents of the 


mem 
11110110 
2.3 
or 
IS6-1041 
AL IS-bit operationl or AX 116-bit operationl register. The 


IS-bitl 
mod 101 r m 
4 
+ EA 
result is stored in the AX register. in the case of an S x S- 


10lSPI 
bit operation. or the OX register Ihigh-order 16 bits) and 


10lSPI 
the AX register (low-order 16 bitsl in the case of a 16 x 


mem 
11110111 
2.3 
or 
1134-1601 


16-bit operation. This is a signed multiplication operation. 


116-bitl 
mod 101 rim 
4 
+ EA 
The execution time 
may vary by 
1S clocks 
for 
S-bit 


10lSPI 
operands and 26 clocks for 16-bit operands. The variation 


10lSPI 
is data-dependent 


AAM 
04 
2 
S3 
] 
X 
X 
? 
X 
? 
After multiplying two unpacked decimal operands. adjust 
OA 
the product in AX to become an unpacked decimal result 


The routine further assumes that the SI register points at the ASCII string, the DL 
register 
contains 
the multiplier, 
a single ASCII digit, the 
DI register 
points to the 
memory locations where the result, a BCD string, will be stored, and the CX register 
contains the number of digits in the multiplicand. 
The result stored in the BCD string 
will have the following form. 


Byte *0 
1... 
_ 


MOV 
AND 


MOV 
INC 
AND 
MUL 
AAM 
ADD 
AAA 
MOV 
INC 
MOV 
DEC 
JNZ 
RET 


(01).0 
DL,OFH 


AL.ISI) 
SI 
AL,OFH 
DL 


;CLEAR INITIAL BYTE OF BCD STRING 
;AND OFF BITS 4 AND 5 OF MULTIPLIER 


;LOAD MULTIPLICAND 


;CLEAR UPPER NIBBLE 
;MUL TIPLY BCD· 
BCD 


;ADJUST 
RESULT 


;ADD IN BCD 


[DI],AL 


01 
[DI],AH 
CX 
MUL TIPL Y$NEXT$BYTE 


The routine in Figure 4-23 divides a string of ASCII digits by a single ASCII digit. 


The result is a string of BCD digits. The routine 
assumes 
that the ASCII string is 
organized in the following manner. 


The SI register points to the ASCII string, the DL register contains the divisor, a 
single ASCII digit, the DI register points to the memory locations where the result, a 
BCD string, will be stored, and the CX register contains the number of digits in the divi- 
dend. The result, stored in the BCD string, will be of the following form. 


Byte *0 
1 
_ 


_____ 
1 
Low-order 
byte 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


DIV 
reg 
11110110 
2 
80 - 
90 
7 
? 
7 
? 
7 
7 
if w = 0, ~[AH] remaindeJ -[AXl![mem/reg] 
18-bitl 
11110 
reg 
[AL] quotient 


reg 
11110111 
2 
144-162 
if w = 1,~DX] remaindeJ - 
(DXI/IAX] 
Imem/reg] 


116-bitl 
11110 
reg 
lAX] quotient 


mem 
11110110 
2,3 
or 
186 - 
96) 
Divide the AX register, in the case of a 16-bit operation, or 


(8-bit) 
mod 110 rIm 
4 
+ EA 
the DX register Ihigh-order 16 bits) and AX (low-order 
16 
(DISP) 
bitsl, in the case of a 32-bit operation, by the 8- or 16-bit 
(DISPI 
contents of the memory location or register specified by 


mem 
11110111 
2,3 
or 
(150-168) 
mem/reg. 
In the case of a 16 x 8-bit division, the quotient 


(16-bit) 
mod 110 rIm 
4 
+ EA 
is placed in AL, and the remainder is stored in AH. In the 


(DISPI 
case of a 32 x 16-bit division, the quotient is placed in the 


(DISP) 
AX register, and the remainder is placed in the DX register. 


This is an unsigned division operation. 


The execution 
time 
may vary by 
10 clocks 
for 
8-bit 
operands and 18 clocks for 16-bit operands. The variation 
is data dependent. 


IDIV 
reg 
11110110 
2 
101-112 
? 
? 
? 
? 
? 
? 
if w = 0, [AH] remaindej 
-(AXl!(mem/reg] 
18-bit) 
11111 
reg 
IAL] quotient 


reg 
11110111 
2 
165-184 
if w = 1, [DX] 
remaindeJ -[DX] 
[AX]/lmem/reg] 
(16-bit) 
11111 
reg 
[AX] quotient 


mem 
11110110 
2,3 
or 
(107-118 
Divide the AX register, in the case of a 16-bit operation, or 
(8-bit) 
mod 111 rIm 
4 
+ EA 
the DX register (high-order 16 bits) and AX (low-order 
16 
(DISP) 
bits), in the case of a 32-bit operation, by the 8- or 16-bit 
(DISP) 
contents of the memory location or register specified by 


mem 
11110111 
2,3 or 
1171-190) 
mem/reg. In the case of a 16 x 8-bit division, the quotient 


(16-bit) 
mod 111 rIm 
4 
+ EA 
is placed in AL, and the remainder is stored in AH. In the 


(DISP) 
case of a 32 x 16-bit division, the quotient is placed in the 


(DISP) 
AX register, and the remainder is placed in the DX register. 


This is a signed division operation. 


The execution 
time 
may vary by 
11 clocks 
for 
8-bit 
operands and 19 clocks for 16-bit operands. The variation 
is data dependent. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
D 
I 
T 
S 
Z 
A 
P 
C 


CBW 
98 
1 
2 
[AH) -[AL7) 


Extend the Si9n bit of the AL register, bit 7, into the AH 
register. 


CWD 
99 
1 
5 
[DX) -[AX15) 


Extend the sign bit of the AX register, bit 15, into the DX 
register. 


AAD 
D5 
2 
60 
? 
X 
X 
? 
X 
? 
Decimal 
adjust 
dividend 
in 
AL 
prior 
to 
dividing 
an 
OA 
unpacked 
decimal 
divisor, 
to 
generate 
an 
unpacked 
decimal quotient. 


AND 
XOR 


MOV 
INC 


AND 
AAD 
DIV 


MOV 
INC 


DEC 
JNZ 
RET 


DL,OFH 
AH,AH 


AL,ISI] 
SI 


AL,OFH 


;CLEAR HIGH-ORDER NIBBLE 
;CLEAR AH 


;LOAD BYTE FROM ASCII STRING 


;CLEAR BITS 4 AND 5 
;ADJUST 
USING AH 
DL 


[Dll.AL 


01 


CX 
DIVIDE$NEXT$BYTE 


Dividing a 64-bit dividend by a 32-bit divisor is not an easy task on the 8086. The 
DIY and IDlY instructions 
are not particularly useful when performing this function. To 
divide a 64-bit number 
by a 32-bit number, 
a subtract 
and shift algorithm 
must be 
employed. The construction 
of such a routine, a nontrivial task, is beyond the scope of 
the current discussion. 


8086 compare 
instructions 
are shown in Table 4-6; they execute 
like subtract 
instructions, 
however 
no result is returned 
to a register 
or memory 
location. 
The 
subtract operation is used only to set status flags. 
Use of compare instructions 
is illustrated in Figures 4-24 through 4-26. 
Two string primitive instructions, 
CMPS and SCAS, also perform comparisons. 


rrhese 
instructions 
are discussed 
with the other 
primitive 
instructions 
later in the 
chapter. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


CMP 
mem/reg" 
0Oll10dw 
2,3 or 
reg-reg: 3 
X 
X 
X 
X 
X 
X 
[mem/reg, 
1 - 
Imem/reg2l 
mem/reg2 
mod rrr rim 
4 
mem-reg: 
Subtract the 8- or 16-bit contents 
of the memory location 
(DISP) 
9 + EA 
or register 
selected 
by mem/reg2 
from 
the 8- or 16-bit 
(DISPI 
reg-mem: 
contents 
of the memory 
location 
or register 
specified 
by 
9 + EA 
mem/reg" 
use the result to set the flags, then discard the 


result. 


CMP 
mem/reg, 
100000sw 
3.4,5 
reg: 4 
X 
X 
X 
X 
X 
X 
lmem/regl 
- 
data 
data 
mod 111 rim 
or6 
mem: 
Subtract the 8 or 16 bits of immediate data from the 8- or 
(DISP) 
10"+ EA 
16-bit 
contents 
of 
the 
memory 
location 
or 
register 
(DISP) 
specified 
by mem/reg, 
use the result to set the flags, then 
kk 
discard the result. 


jj (if sw = 01) 


CMP 
aC,data 
0011110w 
2 or 3 
4 
X 
X 
X 
X 
X 
X 
lacl - 
data 


kk 
Subtract 
the 8 or 16 bits of immediate 
data from the AL 
li (ifw = 1) 
(8-bit operation) 
or the AX (16-bit 
operation) 
register, use 
the result to set the flags, then discard the result. 


The routine in Figure 4-24 determines 
the number 
of a characters in a string. 
This routine assumes that the SI register addresses the string being scanned and 
AH contains a character that identifies the end of the string. When this routine finishes 
executing, the DX register will contain the number of characters between the start of the 
string and the terminating 
character. 


MOV 


INC 
MOV 
INC 
CMP 
JNZ 
RET 


DX,OFFFFH 


DX 
AL,ISI) 
SI 
AH,AL 
SCAN$FOR$DELIMITER 


;INITIALIZE COUNT TO -1 


;INCREMENT COUNT 
;LOAD BYTE FROM STRING 
;UPDATE 
POINTER 


;COMPARE 
WITH TERMINATION 
;BRANCH IF NOT TERMINATION 


The string primitive 
instruction 
SCAS can be used to reduce 
the amount 
of 
memory and speed of execution of this routine. The SCAS instruction 
is discussed with 
the other string primitive instructions 
later in this chapter. 


The routine in Figure 4-25 will determine 
the largest 8-bit unsigned number in a 
sequence of 8-bit unsigned numbers, This routine assumes that the SI register addresses 
the sequence of numbers 
to be scanned, while the CX register contains the number 
of 
bytes to be scanned, When this routine has finished executing, AH will contain the max- 
imum value, and DX will point at the maximum 
value. 


XOR 


MOV 
CMP 
JAE 
MOV 
MOV 
INC 
DEC 
JNZ 
RET 


AH.AH 


AL,ISI) 
AH.AL 
UPDATE$PTR 
AH,AL 
DX,SI 
SI 
CX 
SCAN$NEXT$BYTE 


;INITIALIZE MAX. NUMBER 


;LOAD BYTE FROM SE~UENCE 
;COMPARE WITH CURRENT MAX. 
'#0 


;SAVE NEW MAX. NUMBER 
;SAVE LOCATION 
OF MAX. 
'#0 


The routine in Figure 4-25 and the routine in Figure 4-26 can be improved 
by 
using string primitive instructions. 


The routine in Figure 4-26 will determine 
the largest l6-bit signed number 
in a 
sequence of 16-bit signed numbers. This routine assumes that the SI register addresses 
the series of numbers to be scanned while the CX register contains the number of words 
to be scanned. 


MOV 
MOV 
CMP 
JGE 
MOV 
MOV 
INC 
INC 
DEC 
JNZ 
RET 


BX.8000H 
AX,(SIl 
BX,AX 
UPDATE$PTR 
BX,AX 
DX,SI 
SI 
SI 
CX 
SCAN$LOOP 


;INITIALIZE MAX. NUMBER 
;LOAD NUMBER FROM SEQUENCE 
;COMPARE WITH 
CURRENT MAX. NUMBER 


;SAVE NEW MAX. NUMBER 
;SAVE LOCATION 
OF MAX. NUMBER 
;UPDATE 
PRT. 


Earlier in this chapter, two buffer translation routines were presented. The follow- 
ing routine includes error checking. The characters 
in the buffer which is to be trans- 
lated must lie in the range 2016 
~ 
character 
~ 5F16, This routine assumes that the BX 
register contains the address of the conversion table, the SI register contains the address 
of the buffer to be translated, 
and the CX register contains the number of data elements 
to be translated. 


MOV 
SUB 
JB 
CMP 
JA 
XLAT 
MOV 
INC 
DEC 
JNZ 
RET 
RET 


AL,(SI) 
AL,20H 
TRANSLATE$ERROR 
AL.3FH 
TRANSLATE$ERROR 


;LOAD BYTE FROM SOURCE 
;NORMALIZE 
;IF LESS THAN 0, REPORT ERROR 
;COMPARE 
WITH NORMALIZED 
MAX. 
;IF GREATER. REPORT ERROR 
;TRANSLATE 
NORMALIZED 
VALUE 


;STORE CONVERTED DATA 
;ADJUST 
POINTERS 
(SI],AL 
SI 
CX 
TRANSLA TE$LOOP 


;GOOD RETURN WITH 
Z=1 
;ERROR RETURN WITH 
Z=O 


The routine returns with Z= 1 if there were no translation errors and Z=O for one 
or more translation 
errors. Note that the subtraction 
instruction 
limits the size of the 
conversion table to 4016 bytes. Data could be validated using two CM? instructions 
with 
the BX register addressing a location 2016 bytes before the conversion table. 


AND 
NOT 
OR 
XOR 


In addition, 
the TEST instruction 
performs an AND operation 
without altering 
either of the operands. 
The 8086 logical instructions 
are shown in Table 4-7. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


AND 
mem/reg" 
001000dw 
2,3, or 
reg-reg: 3 
X 
X 
X 
? 
X 
X 
[mem/reg,)- 
[mem/reg,) 
AND [mem/reg21 


mem/reg2 
mod rrr rim 
4 
mem-reg: 
AND the 8- or 16-bit contents of the memory location or 
IDISP) 
9 + EA 
register selected by mem/reg2 with the 8- or 16-bit con- 
IDISP) 
reg-mem: 
tents 
of the 
memory 
location 
or register 
specified 
by 
16 + EA 
mem/reg" 
leaving the result in the memory location or 
register specified by mem/reg,. 


AND 
mem/reg, 
1000000w 
3.4,5 
reg: 4 
X 
X 
X 
? 
X 
X 
Imem/regl 
- 
Imem/reg) AND data 
data 
mod 100r/m 
or 6 
mem: 
AND the 8 or 16 bits of immediate data with the 8- or 16- 


IDISP) 
17 + EA 
bit contents of the memory location or register specified 
IDISP) 
by mem/reg, storing the result in the memory location or 
kk 
register specified by mem/reg. 


jjlifw= 
11 


AND 
aC,data 
0010010w 
20r3 
4 
X 
X 
X 
? 
X 
X 
lac) - 
lacl AND data 
kk 
AND the 8 or 16 bits of immediate data with the AL (8-bit 
jj lif w = 1) 
operation) 
or the AX (16-bit 
operationl 
register, leaving 
the result in the AL or AX register. 


NOT 
mem/reg 
1111011w 
2,3 or 
reg: 3 
Imem/regl 
- 
Imem/regJ 
modOl0r/m 
4 
mem: 
Ones complement 
the 
8- 
or 
16-bit 
contents 
of 
the 
IDISP) 
16 + EA 
memory location or register specified by mem/reg. 


IDISP) 


OR 
mem/reg, , 
0OOO10dw 
2,3 or 
reg-reg: 3 
X 
X 
X 
? 
X 
X 
Imem/reg,) 
-[mem/reg, 
J OR Imem/reg21 
mem/reg2 
mod rrr rim 
4 
mem-reg: 
OR the 8- or 16-bit 
contents of the memory location or 
IDISP) 
9 + EA 
register specified by mem/reg2 
with the 8- or 16-bit con- 
IDISP) 
reg-mem: 
tents 
of 
the 
memory 
location 
or register 
selected 
by 
16 + EA 
mem/reg" 
leaving the result in the memory location or 
register selected by mem/reg,. 


OR 
mem/reg, 
1000000w 
3.4,5 
reg: 4 
X 
X 
X 
? 
X 
X 
Imem/regl 
- 
Imem/reg) 
OR data 
data 
mod001 
rim 
or 6 
mem: 
OR the 8 or 16 bits of immediate data with the 8- or 16- 
(DISP) 
17 + EA 
bit contents of the memory location or register specified 
(DISPI 
by mem/reg, 
leaving the result in the memory location or 


kk 
register specified by mem/reg. 


jj (if w = 1) 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


OR 
ac.data 
0000110w 
2 or 3 
4 
X 
X 
X 
7 
X 
X 
lac] - 
lacl OR data 
kk 
OR the 8 or 16 bits of immediate data with the AL 18-bit 
~ lifw 
= 11 
operationl 
or AX 116-bit operation) 
register, leaving the 
result in the AL or AX register. 


TEST 
mem/reg" 
1000010w 
2.3 or 
reg-reg: 3 
X 
X 
X 
? 
X 
X 
Imem/reg,] 
AND Imem/reg2) 
mem/reg2 
mod rrr rIm 
4 
mem-reg: 
AND the 8- or 16-bit contents of the memory location or 
(DISP) 
g + EA 
register specified by mem/reg2 with the 8- or 16-bit con- 


(DISPI 
tents 
of the 
memory 
location 
or register 
specified 
by 
mem/reg,. 
using the result to set the flags. then discard- 
ing the result. 


TEST 
mem/reg. 
1111011w 
3.4.5 
reg: 5 
X 
X 
X 
? 
X 
X 
Imem/reg] 
AND data 
data 
mod 000 rim 
or6 
mem: 
AND the 8 or 16 bits of immediate data with the 8- or 16- 
(DISP) 
11 + EA 
bit contents of the memory location or register specified 
(DISP) 
by mem/reg, 
using the result to set the flags. then dis- 


kk 
carding the result. 


jjlifw= 
1) 


TEST 
aC,data 
1010100w 
2 or 3 
4 
X 
X 
X 
? 
X 
X 
lac] AND data 
kk 
AND the 8 or 16 bits of immediate data with the AL (8-bit 
~ (if w = 1) 
operation) 
or AX 
(16-bit 
operation) 
register, 
using the 
result to set the flags, then discarding the result. 


XOR 
mem/reg" 
001100dw 
2,3 or 
reg-reg: 3 
X 
X 
X 
7 
X 
X 
Imem/reg,]- 
[mem/reg,] 
XOR Imem/reg2] 
mem/reg2 
mod rrr rIm 
4 
mem-reg: 
Exclusive-OR the 8- or 16-bit 
contents 
of the memory 
(DISPI 
9 + EA 
location or register specified by mem/reg2 with the 8- or 
(DISPI 
reg-mem: 
16-bit 
contents 
of 
the 
memory 
location 
or 
register 
16 + EA 
specified by mem/reg" 
leaving the result in the memory 
location or register specified by mem/reg,. 


XOR 
mem/reg, 
1000000w 
3,4,5 
reg: 4 
X 
X 
X 
7 
X 
X 
Imem/reg] 
-Imem/reg] 
XOR data 
data 
mod 110 rIm 
or 6 
mem: 
Exclusive-OR the 8 or 16 bits of immediate data with the 
(DISP) 
17 + EA 
8- or 16-bit contents of the memory location or register 
(DISPI 
specified 
by mem/reg, 
leaving the result in the memory 


kk 
location or register specified by mem/reg. 


jj (if w = 1) 


XOR 
ac.data 
0011010w 
2 or 3 
4 
X 
X 
X 
7 
X 
X 
lac] - 
lacl XOR data 
kk 
Exclusive-OR the 8 or 16 bits of immediate data with the AL 
~ lif w = 1) 
(8-bit operation) 
or AX (16-bit 
operation) 
register, leaving 
the result in the AL or AX reaister. 


The routine 
in Figure 4-29 illustrates 
8086 logical instructions. 
Consider 
the 
following scenario; 


1. 
An I/O port is receiving a stream of data blocks. These data blocks are in Sig- 
netics hex code. 


The routine in Figure 4-29 is an interrupt service routine that handles the above 
scenario via the following steps: 


1. 
Save the raw information. 


2. 
Convert the raw information 
into object code. 


3. 
Check the checksum. 


4. 
Set a message "message 
completed" 
bit when processing is complete. 


The 
routine 
in Figure 
4-29 performs 
these 
functions 
by implementing 
the 
flowchart in Figure 4-28. 


As illustrated below, the Signetics object code format has the following elements. 


1. 
A gap having any number 
of non-printing 
characters, 
including spaces 


2. 
Start of block character: 
a colon 


3. 
Address field: four hex characters 


4. 
Count field: two hex characters in range 0 to IE 


5. 
Block Control Character for address and count fields: two hex characters 


6. 
Data field: twice the value in the count field which is the number of memory 
locations loaded by the current block 


7. 
Block Control Character: 
two hex characters 
~!~~ 
0455B024Fl01 
F050400 
~ 


@ - Start of block character (colon) 
® - Starting address for block (H'0500') 
@ - Number of bytes in block (H'OA' = 10) 
® - BCC 
byte for fields 3 and 4 IH'3C') 
® - Data. two characters per byte 
(2) - 
BCC 
byte for field 6 (H'30') 


The Block Control 
Character, 
otherwise 
known as a checksum, 
is created 
by 
manipulating 
the data bytes within a data string. The Signetics object code format 
includes two Block Control Characters. The first Block Control Character 
@ applies to 
the three preceding data bytes, which contain the starting address for the block and the 
number 
of bytes in the block. The second Block Control Character 
(j) applies to the 
string of data bytes in field ® 


To generate 
a Block Control 
Character, 
the 
character 
is first cleared, 
then 
repetitively 
exclusive-ORed 
with each data 
byte in the 
related 
string. 
After 
each 
exclusive-OR 
step, 
the 
result 
is left rotated 
one bit. To illustrate 
Block Control 
Character logic, consider the first Block Control Character in the Signetics object format 
illustration; 
it is generated 
from the byte sequence 
05 00 OA as follows: 


BCC 
Data 
BCC XOR Data 
Left Rotate 


00000000 
00000101 
00000101 
00001010 


00001010 
00000000 
00001010 
00010100 


00010100 
00001010 
00011110 
00111100 
BCC = 3CH 


This routine 
uses four variables in memory, 
in addition 
to the buffer written 


which the data is stored. The variables are: 


STATUS$BYTE: 


6 


{ 


START$OF$MESSAGE$8IT 
o - Message 
not started 
1 - Either Header or Data Message started 


{ 


HEADER$DATA$8IT 
o - Data being processed 
1 - Header being processed 


{ 


MESSAGE$COMPLETED$8IT 
o - Message 
not done 
1 - Message 
done 


{ 


TRANSLA 
TION$ERROR$8IT 
0- 
No error 


1 - Translation 
Error 


{ 


HEADER$CHECKSUM$ERROR 
$81T 
0- 
No error 
1 - Header Checksum 
Error 


{ 


DATA$CHECKSUM$ERROR$8IT 
o - No error 
1 - Data Checksum 
Error 


One byte which contains the number 
of characters 
left 


to be read in either the header or the data block. The 
start 
of 
message 
code 
initializes 
this 
variables 
to 


NUMBER$OF$HEADER$CHARACTERS, 
which 
is 


equated to 8. When the header has been processed, this 
variable is initialized to 2* (Number of object code bytes 
in data block) +2. 


One byte which contains 
the number 
of object code 


bytes in the data block. This variable is initialized after 
the header has been processed. 


A 16-bit offset address which indicates where the next 
byte of data from the I/O port should be stored. This 
variable 
is initialized 
by the Start of Message 
code, 


which loads immediate 
data into BUFFER$POINTER. 


This assumes 
that the buffer will always be in a fixed 


position. If it is necessary 
to vary the location of the 


buffer, this variable could be initialized by a system or 
user program. 


STATUS 
BYTE 
EQUATES 


ST ARTSOFSMESSAGESBIT 
HEAOERSDAT 
ASBIT 


MESSAGESCOMPLETEDSBIT 
TRANSLA 
liON 
$ERRORSBIT 
HEADER SCHECKS 
UMSERRORSBIT 


OAT ASCHECKSUMSERRORSBIT 


BUFFER 
ADDRESS 
EQUATES 


BUFFERSAODRESS 
ST ARTSOFSHEADERSPOINTER 
ST ART$QF$OAT 
A$POINTER 


10 
EQUATE 


CHARACTER$PORT 


MISCELLANEOUS 
EQUATES 


S T ARTSOFSMESSAGESCHARAC 
TER 


NUMBERSOFSHEADERSCHARACTERS 


OAT A DEFINITION 


ST A TUSSBYTE 
CHARACTER$COUNT 
BUFFERS POINTER 


OBJECTSBYTESCOUNT 
INTERRUPT$HANDLER 


lOOOH.OFFSET 
ADDRESS 
FOR BUFFER 


BUFFERSADDRESS 
BUFFER$ADOAESS 
+ 8 


1 
1 


1 


1 
AL,CHARACTERSPORT 
,READ 
CHARACTER 


STATUSSBYTE.STARTSOFSMESSAGESBIT 
.HAS 
A MESSAGE 


HEADERSORSDAT 
A 
.BEEN STARTED' 


AL.ST 
ARTSOFSMESSAGESCHARACTER 
:START 
OF MESSAGE 
PERFORM$A$RET 
.CHARACTER) 


ST ATUSSBYTE.ST 
ARTSOFSMESSAGESBIT 
OR 


HEADEA$DAT A$SIT 
,INITIALIZE 


CHARAC 
TER$COUNT. 
NUMBERSOF$HEADER$CHAAACTERS 


BUFFER$PQINTER,BUFFER$ADDRESS 
,MOVE 
IMMEDIATE 


.OAT A TO BUFFER 
,POINTER 


,SAVE 
CHARACTER 
Dl,BUFFERSPOINTER 
IDII.AL 


01 
BUFFER$POINTER.DI 
CHARACTER$CQUNT 
PERFQRM$A$RET 
ST ATUSSBYTE,HEAOERSOAT 
A$SIT 
OAT A$PROCESSING 


CX.DOO' 
51.S T ART$OF$HEADER$POINTER 
DtSI 


CONVERT 
$TWOSASCII$TO$H::X 
TRANSLATIQN$ERRQR 
IDII.AL 


01 
CX 
HEADER$TRANSlATE$LOQP 


51.S T ART$OF$HEADER$POINTER 
AX.AX 
CX.0003 


AL.ISII 
Al.l 
SI 


CX 
HEADER$CHECKSUM$LOOP 


.DECREMENT 
AND 
TEST 


.FoR 
CHECKSUM 
DONE 


.COMPARE 
CALCULATED 
CHECKSUM 


.WITH 
RECEIVED 
CHECKSUM 


XOR 
STATUS$BYTE.HEADER$DATA$BIT 
.HEADER 
GOOD. 
SWITCH 
TO 


DATA 
PROCESSING 


MOV 
AX.ISI-2J 
.LOAD 
" 
OF OBJECT 
MOV 
OBJECT$BYTECOUNT.AX 
.CODE 
BYTES 
FROM 
HEADER 


SAL 
AX.l 
.GET 
NUMBER 
OF ASCII 
CHARACTERS 


ADO 
AX.02 
.ADD 
2 
RET 
CHARACTER$COUNT.AX 
.SAVE 
FOR DATA 
PROCESSING 


MOV 
ST ATUS$BYTE.HEADER$CHECKSUM$ERROR$BIT 


;TURN 
ON ERROR 
BIT 


MOV 
ST ATUS$BYTE. 
TRANSlATION$EARQR$BIT 
. TURN 
ON 
ERROR 
BIT 


RET 
MOV 
CX,OBJECTSBYTESCOUNT 
,SET 
UP FOR CONVERSION 


MOV 
SI,ST ARTSOFSOAT 
ASPOINTER 
.FROM 
ASCII 
TO 
HEX 


MOV 
DI,ST ARTSOFSDAT 
ASPOINTER-4 


CALL 
CONVERTSTWOSASCIISTOSHEX 


JZ 
TRANSLATIONSERROR 


MOV 
IDIJ.AL 


INC 
01 


DEC 
CX 
;OECREMENT 
AND 
TEST 
FOR 


JNZ 
OAT ASTRANSLA 
TESLOOP 
,DONE 


MOV 
SI,ST ARTSOFSDAT 
ASPOINTER 
-4 
,SET 
UP FOR 
CHECKSUM 


XOR 
AX.AX 
:CALCULATION 


MOV 
CX,OBJECTSBYTESCOUNT 


XOR 
AL,ISII 


ROL 
AL.l 
;CALCUlATE 
CHECKSUM 


INC 
SI 


DEC 
CX 
JNZ 
OAT ASCHECKSUMSLOOP 
CMP 
AL,ISII 
,COMPARE 
CALCULATED 
CHECKSUM 


JNZ 
OAT ASCHECKSUMSERROR 
,WITH 
RECEIVED 
CHECKSUM 


XOR 
ST ATUSSBYTE,ST 
ARTSOFSMESSAGESBIT 
.TURN 
ON 
OR MESSAGESSCOMPLETEDSBIT 
,MESSAGE 
COMPLETED 
BIT 


RET 
,TURN 
OFF START 
OF MESSAGE 
BIT 


MOV 
ST ATUSSBYTE,DA 
T ASCHECKSUMSERRORSBIT 
;TURN 
ON ERROR 
BYTE 


RET 


Several assumptions 
have been made by the logic of the program illustrated 
in 
Figure 4-28; they include: 


1. 
The state of the machine 
has been saved and on return 
will be correctly 
restored. 


2. 
The segment 
registers are set to the correct values. 


3. 
Any hardware errors flags set by the I/O port are handled elsewhere. 


4. 
A subroutine 
named CONVERT$TWO$ASCII$TO$HEX 
that converts 
two 
ASCII characters 
pointed to by the DI register into one hex byte and returns 
the value in AL. 


It is reasonable to expect that the first three assumptions 
are provided for by some 
sort of system 
interrupt 
handler. 
If this is not the case, these assumptions 
can be 
handled by: 


1. 
Using the code shown in Figures 
4-14 and 4-15 to save and restore 
the 
machine state. Note, however, that this routine does not use the BX, DX, or 
BP registers. It is not, therefore, 
necessary to save and restore these registers. 


The RETURN instruction 
should be altered to an IRET instruction. 


2. 
Perform an appropriate 
segment register initialization 
routine. 


3. 
Read the status port for the input device and set a flag in the status byte to 
indicate any errors. Bits 6 or 7 could be used for this purpose. 


The CONVERT$TWO$ASCII$TO$HEX 
routine 
will be presented 
later in this 
chapter. 


8086 string primitive instructions 
are shown in Table 4-8. Each string primitive 
instruction 
performs a sequence of operations normally handled by an instruction 
loop. 


The string primitive instruction 
performs an operation specified by the primitive, then 


increments 
or decrements 
the pointer registers involved in the operation. On each itera- 


tion the affected pointer registers can be incremented 
or decremented, 
by 1 or 2. Pointer 
registers will be incremented 
if the value of the Direction flag in the Flags register is 0; 
the affected pointers 
will be decremented 
if the value of the Direction 
flag is 1. The 
affected pointer registers will be incremented 
or decremented 
by 1 if the low-order bit of 
the string primitive operation code is O. If the low-order bit of the string primitive opera- 
tion code is 1, the affected pointer registers will be incremented 
or decremented 
by 2. 


There are these five string primitives; 


MOYS - Move 8 or 16 bits of data from memory location to memory location 
LODS - Load 8 or 16 bits of data from memory into the AL or AX register 
STOS - Store the AL (8-bit operation) 
or AX 06-bit 
operation) 
register into 
memory 
SCAS - Compare the AL (8-bit operation) 
or AX 06-bit 
operation) 
register with 


memory 


CMPS - Compare memory location with memory location 


String primitive instructions 
used fixed addressing 
modes, as follows: 


MOYS - Move data from the memory location addressed by the SI register in the 


Data Segment to the memory location addressed by the or register in the 
Extra Segment. 


LODS - Load data from the memory location addressed 
by the SI register in the 


Data Segment to the AL or AX register. 


STOS - Store the AL or AX register into the memory location addressed by the 01 


register in the Extra Segment. 


SCAS - Compare 
the AL or AX register contents 
with the data in the memory 


location addressed 
by the or register in the Extra Segment. 


CMPS - Compare the data in the memory location addressed by the SI register in 
the Data Segment with the data in the memory location addressed by the 
or register in the Extra Segment. 


Segment override prefixes allow SI to access a segment other than the Data Seg- 
ment. Segment override prefixes may not be used with the or register. The DI register 
must access the Extra Segment. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


LODS 
1010110w 
1 
12 
(ac! -([SI]I. 
[SI] -(SI] 
±DEL TA 
9 + 13' 
Move data into the AL (8-bit 
operation 
or AX 
(16-bit 
operation) register from the memory location addressed 
by SI. Increment or decrement SI depending on the value 
of the Direction Flag. DELTA is 1 if w = 0, 2 if w = 1 


MOVS 
1010010w 
1 
18 
([Dill -([Sill, 
[SI] -(SI] 
±DELTA 
9 + 17" 
[DI] -(DI] 
±DELTA 
Move 8 or 16 bits of data from 
the memory 
location 


addressed by SI to the memory location addressed by DI. 
Increment or decrement SI and DI depending on the value 
of the Direction Flag. DELTA is 1 if w = 0, 2 if w = 1. 


STOS 
1010101w 
1 
11 
([Dill -(acl, 
[DI] -(DI! 
±DEL TA 
9 + 10' 
Move the contents of the AL (8-bit operation) or AX (16- 
bit operation) register. Increment or decrement DI depend- 
ing on the value of the Direction Flag. DELTA is 1 if w = 0, 
2 if w = 1. 


CMPS 
1010011w 
1 
22 
X 
X 
X 
X 
X 
X 
([Sill - 
([DI]I. [SI] -(SI] 
± DELTA 
9 + 22· 
[DI] -[DI] 
±DELTA 
Subtract 
the 8 or 16 bits addressed by the DI register 
from the 8 or 16 bits addressed by the SI register. Use the 
result to set the flags, then discard the result. Increment or 
decrement SI and DI depending on the value of the Direc- 
tion Flag. DELTA is 1 if w = 0, 2 if w = 1. 


SCAS 
1010111w 
1 
15 
X 
X 
X 
X 
X 
X 
lac! - 
([Dill 
9 + 15· 
Subtract 
the 8 or 16 bits addressed by the DI register 
from the AL (8-bit operation) or the AX 116-bit operation) 
register. Use the result to set the flags, then discard the 
result. Increment or decrement DI depending on the value 
of the Direction Flag. DELTA is 1 if w = 0, 2 if w = 1. 


REP is a one-byte prefix that converts any string primitive instruction into a reex- 
ecuting loop. 


String primitive instructions 
each execute as one iteration of a loop. The source 
and destination 
pointer registers SI and DI are assumed, by string primitive instructions, 
to supply source and/or 
destination 
memory addresses; 
these addresses 
are automat- 
ically incremented 
or decremented 
following each execution 
of the string primitive 
instruction. 
This leaves the address(es) 
pointing to the next location in the string to be 
accessed. The REP prefix specifies a termination 
condition which causes the string pri- 
mitive instruction 
to continue executing until the termination 
condition is met. 


For the MOYS, LODS, and STOS string primitives, there is a single termination 
condition. 
The CX register 
is treated 
as a counter; 
each time the string primitive 
executes, 
the CX register contents 
is automatically 
decremented 
by REP prefix logic. 


When the CX register contents decrements 
to 0, the instruction 
following the string pri- 
mitive is executed. 


CMPS and SCAS also use the CX register as a counter in the presence of a REP 
prefix; and as described for MOYS, LODS, and STOS, the CX register contents decre- 
menting to 0 becomes a termination 
condition. In addition, CMPS and SCAS set status 
flags following each iterative execution. The level of the zero status bit serves as an addi- 
tional termination 
condition. For this to be possible the CMPS and SCAS string primi- 
tives use two forms of the REP prefix: 


1. 
REPZ or REPE which causes a termination 
if the zero status is set after any 
iterated execution 
of the string primitive. 


2. 
REPNZ or REPNE which causes a termination 
condition if the zero status is 
reset after any iterated execution. 


In summary, 
the REP prefix surrounds 
a string primitive instruction's 
execution 
with the following steps: 


1. 
Tests the CX register contents. 
If CX contains 0 proceed to the instruction 
that follows the string primitive. 


2. 
Service any pending interrupt. 


3. 
Execute the string primitive instruction 
once. The pointer register addresses 
are incremented 
or decremented 
during this step as a normal part of the string 
primitive instruction's 
execution. 


4. 
Decrement 
the CX register contents. 


Sa. 
For MOYS, LODS, or STOS proceed to step 1. 


Sb. 
For CMPS or SCAS, compare the zero status with the conditions 
specified 
by the REP prefix. If the specified zero status does not exist, then return to 
step 1; otherwise execute the instruction 
following the string primitve. 


String primitive instructions 
are very powerful. Sequences such as: 


MOV 
AL,[SI] 
INC 
SI 


MOV 
AX,[SI] 
INC 
SI 
INC 
SI 


Consider Figure 4-2. If the Direction Flag is set to 0, this sequence of instructions 


may be replaced by 


In addition, 
note that the REP and MOVW instructions 
are single byte instructions, 


therefore, 
a CALL to this routine would be more expensive 
than inserting 


directly into the program. 
Figure 4-9, the buffer initialization 
routine, 
can be replaced by 


This replacement 
assumes that the Direction Flag is set to O. 


As an exercise, the reader should look over the other programs given earlier on 
this chapter and look for examples that can use the string primitives, 
Consider a variation of the program illustrated 
in Figure 4-1 that compares two 
strings of bytes. Written out completely, the program would appear as follows: 


MOV 
CMP 
JZ 
INC 
INC 
DEC 
JNZ 


AL,[SJ] 
[DIl.AL 
EQUAL 
SI 
DI 
CX 
COMP$BYTES 


;LOAD BYTE FROM SOURCE 
;COMPARE 
WITH DESTINATION 
;TEST FOR SIMILAR BYTES 
;ADJUST 
POINTERS 


:DECREMENT NUMBER TO MOVE 
;LOOP IF NOT DONE 


CMP 
compares 
bytes 
or 
words, 
register-to-register, 
register-to-memory 
or 
memory-to-register. 
In Figure 
4-30 the contents 
of two memory 
buffers 
are compared, 
looking 
for identical 
bytes. 
Pointer 
registers 
SI and 01 address 
the source 
and destina- 
tion registers, 
respectively. 
Registers 
SI, 01, and CX have 
been 
selected 
deliberately 
in 
Figure 
4-30, because 
these 
are the registers 
used by the CMPSB 
and CMPSW 
string 
pri- 
mitives; 
this allows the entire 
program 
sequence 
illustrated 
in Figure 
4-30 to be replaced 
by: 


This 
group 
of 8086 instructions 
unconditionally 
alter 
program 
counter 
contents, 
and in some 
cases they alter Code 
Segment 
register 
contents 
as well. These 
instructions 
are summarized 
in Table 
4-9. 
CALL 
instructions 
are used 
to transfer 
control 
from 
a program 
to a subprogram. 


The subprogram 
may reside 
in the current 
code segment 
or in a code segment 
specified 
by the instruction. 
The address 
of the subprogram 
may be provided 
by the instruction 
as 
an immediate 
address, 
or it may be stored 
in memory 
or registers. 
The four 8086 CALL 
instructions 
allow the following 
possibilities: 


Current Code Segment 
Code Segment Specified 


by Instruction 


Immediate Address 
CALL disp16 
CALL addr 


Address in Memory/Register 
CALL mem/reg 
CALL mem 


The CALL 
dispI6 
instruction 
is the only instruction 
in which a signed 
I6-bit 
num- 
ber is added 
to the 
program 
counter. 
The 
other 
three 
CALL 
instructions 
move 
data 
directly 
from 
memory 
or register 
to the program 
counter. 
Prior 
Code 
Segment 
and/or 
program 
counter 
contents 
are pushed 
onto 
the Stack 
- 
and are thus 
saved. 


RETURN instructions 
are used to transfer control from a subprogram 
back to the 
program 
that 
is cal1ed the 
subprogram. 
RETURN 
instructions 
that 
terminate 
a 
subprogram 
operate as the inverse of the CALL that invoked the subprogram. 
When a 
RETURN 
instruction 
is executed, 
data is popped from the stack into the program 
counter and, optional1y, the Code Segment register. In addition, the RETURN instruc- 
tions can optional1y add a displacement 
to the stack pointer. This al10ws a RETURN 
instruction 
to adjust the stack pointer so it bypasses parameters that are on the stack for 
the subprogram 
to operate 
on. The four 8086 RETURN 
instructions 
al10w these 
options: 


Pop into PC 
Pop into CS,PC 


Normal Return 
RET 
RET 


Add Displacement to 
RET disp16 
RET disp16 


Stack 


Note that the 8086 does not provide Cal1-on-Condition 
or Return-on-Condition 
instructions. 
To implement the corresponding 
8080 instruction, 
the CALL or RETURN 
instruction 
must be combined with a Jump-on-Condition 
instruction. 
For example, the 
fol1owing 8080 instruction 
sequence 


JZ 
CALL 
OR 


NEXT$INSTRUCTION 
;JUMP TO NEXT$INSTRUCTION if ZF = 1 


SUB$PROGRAM 
;JUMP TO SUB$PROGRAM if ZF = 0 


AX.BX 


8086 Jump instructions 
are presented 
in Table 4-9. 8086 Jump instructions 
general1y 
offer the same variations as 8086 CALL instructions. 
An additional Jump instruction 
is: 


which has two bytes of object code, as opposed to the three-byte JM P disp 16 instruction. 
JMP disp is a relative jump; it adds an 8-bit signed binary displacement 
to the program 
counter. This al10ws program jumps to occur within 1-127 bytes of the Jump instruction. 
Numerous 
programming 
examples 
given in this chapter 
use Jump instructions 
and 
il1ustrate their use. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


CALL 
addr 
9A 
5 
28 
[SP) - 
[SP) - 
2, liSP)) - 
[PCl, 


kk 
[SP) - 
[SP) - 
2, liSP)) - 
[CS!. 


jj 
[PC] - 
addr (offset portion), 
[CS] - 
addr (segment portion) 


hh 
Call a subroutine 
in another 
code segment 
space. A new 
gg 
offset 
address, 
jjkk, and a new segment 
address, 
gghh, 


are provided. 


CALL 
disp16 
E8 
3 
19 
[SP) - 
[SP] -2, 
liSP)) - 
[PC) 
kk 
[PC] - 
[PC] + disp16 
jj 
Call a subroutine 
in the current 
code segment. 


CALL 
mem 
FF 
2,3 or 
37 + EA 
[SP) - 
[SP) - 
2, liSP)) - 
[PCl, 
(SEG 
mod 011 
rim 
4 
[SP) - 
[SP) - 
2, liSP)) - 
[CS!. 


+ PC) 
(DISP) 
[PCl - 
[mem!. 
[CS] - 
[mem + 2) 
(DISP) 
Call a subroutine 
in another code segment 
space. The 16 
bits contained 
in the memory 
location addressed 
by mem 
are moved 
into the PC. The contents 
of the succeeding 
memory 
word 
are loaded into the CS register. 


CALL 
memlreg 
FF 
2,3 or 
21 + EA 
[SP) - 
[SP] -2, 
liSP)) - 
[PC) 
(PC only) 
mod 010 
rim 
4 
(mem) 
[PC) - 
[mem/reg) 
(DISP) 
16 (reg) 
Call a subroutine 
in the current code segment. The 16 bits 


(DISP) 
contained 
in the memory location or register addressed by 
memlreg 
are moved into the PC. 


RET 
C3 
1 
8 
[PC) - 
liSP)), 
[SP) - 
[SP] + 2 
Perform 
a return to a calling program 
in the current 
code 


segment. 


RET 
CB 
1 
18 
[PC) - 
liSP)), 
[SP) - 
[SP] + 2. 


[CSl - 
[[SP)!. [SPl - 
[SP) + 2 
Perform a return to a calling program 
in another code seg- 


ment. 


RET 
disp16 
C2 
3 
12 
[PC) - 
liSP)!. 
[SP] - 
[SP) + 2 + disp16 


kk 
Perform 
a return to a calling program 
in the current code 


jj 
segment; 
adjust the stack pointer 
by disp 16. 


RET 
disp16 
CA 
3 
17 
[PC] - 
[[SP)!. [SP] - 
[SP) + 2, 


kk 
[CS) - 
liSP)), 
[SP) - 
[SP] + 2 + disp16 
jj 
Perform a return to a calling program 
in another code seg- 
ment; 
adjust the stack pointer by disp 16. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 


0 
D 
J 
T 
S 
Z 
A 
P 
C 


JMP 
addr 
EA 
5 
15 
[PC)- 
addr (offset portion). (CS] - 
addr Isegment portion I 
kk 
Jump to another code segment. A new offset 
address, 


jj 
jjkk, and a new segment address, gghh, are provided. 


hh 
gg 


JMP 
disp 
EB 
2 
15 
[PCI - 
[PC] + disp 


kk 
Perform a program relative jump. 


JMP 
disp16 
E9 
3 
15 
[PC) - 
[PC) + disp 16 


kk 
Jump to a location in the current code segment. 


jj 


JMP 
mem 
FF 
2,3 or 
24 + EA 
[PC] - 
[mem], ICSl - 
[mem + 2] 


(SEG 
mod 101 rim 
4 
Jump to a location in another code segment. Move the 


+ PC! 
(DISPI 
contents of the memory location addressed by mem into 


(DISP) 
the PC. Move the contents 
of the succeeding 
memory 


location into the CS register. 


JMP 
mem/reg 
FF 
2,3 or 
18 + EA 
[PC) - 
Imem/reg] 
(PC only! 
mod 100 rim 
4 
(mem) 
Jump to a memory location in the current code segment. 


(DISP! 
11 (reg) 
Move the contents 
of the memory 
location 
or register 
(DISP) 
addressed by mem/reg into the PC. 


8086 instructions 
that alter the contents of the program counter based on various 
conditions 
are presented 
in Table 4-10. 
Table 4-11 lists the arithmetic 
comparisons 
that are commonly 
used and shows 
how to derive them with the 8086. 
In general, 
greater or less are adjectives that are applied to signed operations. 


Above or below are adjectives that are applied to unsigned operations. 
The 8086 instructions 
that decrement 
the CX register, then optionally alter the 
contents 
of the program counter 
are presented 
in Table 4-12. These instructions 
are 
commonly referred to as LOOP structures. 
As an exercise, review the previous sections 
of this Chapter and replace the structure 


Each replacement 
represents a savings of one byte of object code. In addition, one 
clock period per execution is saved. For a loop that iterates 100 times per execution, 
this 
represents 
a savings of 100 clock periods, or 20 p..son a 5 MHz 8086. 
The JCXZ instruction 
is unique in this group of instructions 
in that it does not 
jump based on the-contents 
of the Flags register, rather the JUMP is performed if the 
CX register is O. Since the JCXZ instruction shares an interest in the CX register, along 
with the LOOP instructions, 
it is also presented 
with LOOP instructions 
in Table 4-12. 


The LOOP instruction combines the DEC CX and JNZ instructions. 
For example, 
the instruction 
sequence of Figure 4-1 can be rewritten as: 


MOV 
MOV 


INC 


INC 
LOOP 


AL, 
[511 


[011. AL 


51 


01 
MOVE$BYTE5 


In all future code sequences, 
the LOOP instruction 
will replace the 


oEC 
CX 


JNZ 
label 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


JA 
disp 
77 
2 
4/16 
If ([C] OR [Z)! = 0, then [PC) - 
[PC) + disp 
kk 
Branch relative if the Carry and Zero flags are O. 


JNBE 
disp 
same as JA 


JAE 
disp 
73 
2 
4/16 
If IC) = 0, then (PC) - 
[PC) + disp 
kk 
Branch relative if the Carry flag is O. 


JNC 
disp 
same as JAE 
JNB 
disp 
same as JAE 


JB 
disp 
72 
2 
4/16 
If [C] = 1, then [PC] - 
(PC] + disp 
kk 
Branch relative if the Carry flag is 1 


JC 
disp 
same as JB 


JNAE 
disp 
same as JB 


JBE 
disp 
76 
2 
4/16 
If (lC) OR [Z)) = 1, then (PC) - 
[PC) + disp 
kk 
Branch relative if the Carry flag or the Zero flag are equal 
to 1 


JNA 
disp 
same as JBE 


JE 
disp 
74 
2 
4/16 
If [Z) = 1, then (PC] - 
[PC) + disp 
kk 
Branch relative if the Zero flag is 1. 


JZ 
disp 
same as JE 


JG 
disp 
7F 
2 
4/16 
If ([Z] = 0 AND US) = [0))) = 1, then (PC) - 
[PC) + disp 
kk 
Branch relative 
if the Zero flag is 0 and the Sign flag is 
equal to the Overflow 
flag. 


JNLE 
disp 
same as JG 


JGE 
disp 
7D 
2 
4/16 
If (S] = [OJ. then [PC) - 
[PC) + disp 
kk 
Branch relative 
if the Sign flag is equal to the Overflow 
flag. 


JNL 
disp 
same as JGE 


JL 
disp 
7C 
2 
4/16 
If (S] '" 
(OJ. then (PC] - 
(PC] + disp 
kk 
Branch relative if the Sign flag is not equal to the Overflow 
flag. 
JNGE 
disp 
same as JL 


JLE 
disp 
7E 
2 
4/16 
If US] = (0) AND (Z] = 0) = 1, then (PC] - 
[PC) + disp 
kk 
Branch relative 
if the Sign flag is equal to the Overflow 
JNG 
same as JLE 
flag and the Zero flag is O. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


JNE 
disp 
75 
2 
4/16 
If [Z] = 0, then [PC] - 
[PC) + disp 
kk 
Branch relative if the Zero flag is 0. 


JNZ 
disp 
same as JNE 


JNO 
disp 
71 
2 
4/16 
If [0] = 0, then [PC] - 
[PC) + disp 
kk 
Branch relative if the Overflow 
flag is 0. 


JNP 
disp 
7B 
2 
4/16 
If [P] = 0, then (PC] - 
[PC] + disp 


kk 
Branch relative if the Parity flag is 0. 


JPO 
disp 
same as JNP 


JNS 
disp 
79 
2 
4/16 
If [5] = 0, then [PC] - 
[PC) + disp 
kk 
Branch relative if the Sign flag is 0. 


JO 
disp 
70 
2 
4/16 
If (0) = 1, then [PC) - 
[PC) + disp 
kk 
Branch relative if the Overflow 
flag is 1. 


JP 
disp 
7A 
2 
4/16 
If [P) = 1, then (PC) - 
[PC) + disp 
kk 
Branch relative if the Parity flag is 1. 


JPE 
disp 
same as JP 


JS 
disp 
78 
2 
4/16 
If (5] = 1, then [PC] - 
[PC] + disp 
kk 
Branch relative if the Sign flag is 1. 


JCXZ 
disp 
E3 
2 
6/18 
If [CX) = 0, then [PC] - 
[PC) + disp 
kk 
Branch relative if the CX register 
is 0. 


Signed 
Unsigned 


= 
.EO. 
JE or JZ 
Equal or zero 
JE or JZ 
Equal or zero 


"" 
.NE. 
JNE or JNZ 
Not equal or not zero 
JNE or JNZ 
Not equal or not zero 


> 
.GT. 
JG or JNLE 
(Greater) or not (less or equall 
JA or JNBE 
(Above) or not (below or equall 


2: 
.GE. 
JGE or JNL 
(Greater or equall or (not less) 
JAE or JNB 
(Above or equall or (not below) 


< 
• LT. 
JL or JNGE 
(Less) or not (greater or equall 
JB or JNAE 
(Below) or not (above or equall 


~ 
.LE. 
JLE or JNG 
(Less or equall 
or (not greater! 
JBE or JNA 
(Below or equall or (not above) 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


LOOP 
disp 
E2 
2 
5/17 
[CX] -[CX] 
-1, 
if [CX] = 0, then [PC] -[PC] 
+ disp 
kk 
Decrement 
the CX register, 
not affecting 
the flags. If the 
CX register 
is 0, branch relative. 


LOOPE 
disp 
E1 
2 
6/18 
[CX] - 
[CX) - 
1, if [CX] 
';z!c 0, and [Z] = 1. then [PC) - 
[PC] 
kk 
+ disp 
Decrement 
the CX register, 
not affecting 
the flags. If the 
CX register is not 0 and the Zero flag is 1, branch relative. 


LOOPZ 
disp 
same as LOOPE 


LOOPNE 
disp 
EO 
2 
5/19 
[CX) - 
[CX) - 
1, if [CX] 
';z!c 0 and [ZI = O. then [PC) - 
[PC) 
kk 
+ disp 
Decrement 
the CX register, 
not affecting 
the flags. If the 
CX register is not 0 and the Zero flag is O. branch relative. 


LOOPNZ 
disp 
same as LOOPNE 


JCXZ 
disp 
E3 
2 
6/18 
If [CX) = 0, then [PC) -[PC] 
+ disp 
kk 
Branch relative 
if the CX register 
is O. 


The 8086 instructions 
which operate on the Flags register and control various 
aspects of the 8086's external interface are shown in Table 4-13. 


Memory mapped addressing on the 8086 has one significant advantage over I/O 
port addressing. 
The string primitive instructions 
allow repeated operations to be per- 
formed on memory 
addresses, 
depending 
on how the memory 
mapped I/O address 
decoding is performed. 
Compare Figure 4-32, where I/O port addressing is used, with 
Figure 4-33, where memory mapped addressing is used. Both of these routines output a 
block of data. The number 
of bytes in the block is contained 
in the CX register. The 
routine in Figure 4-32 moves the block pointed to by the D1 register out to I/O$PORT. 
The routine in Figure 4-33 moves the block pointed to by the SI register to the memory 
mapped I/O port addressed 
by the DI register. 


I/O port addresses may be specified directly, or the I/O port address may be held 
in the DX register. 8-bit addresses are specified directly; 
16-bit I/O port addresses are 
specified via the DX register. 


LODSB 
OUT 
IO$PORT,AL 
LOOP 
OUTPUT$A$BYTE 
RET 


Note, a block of memory addresses must be assigned to the memory mapped I/O 
ort since MOVS automatically 
increments/decrements 
the addresses in SI and DI. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
D 
I 
T 
S 
Z 
A 
P 
C 


CLC 
F8 
1 
2 
0 
IC] ~O 


Clear Carry status. 


CMC 
F5 
1 
2 
X 
IC] ~ 
[C) 


Complement 
the Carry status. 


CLD 
FC 
1 
2 
0 
ID] ~O 
Clear the Direction 
flag. 


CLI 
FA 
1 
2 
0 
[I] -0 
Clear the 
Interrupt 
enable 
status, 
thereby 
disabling 
all 
interrupts. 


STC 
F9 
1 
2 
1 
[C] ~1 
Set the Carry status. 


STD 
FD 
1 
2 
1 
[D] -1 
Set the Direction 
flag. 


STI 
FB 
1 
2 
1 
[1]-1 
Set the Interrupt 
flag to 
1, thereby 
enabling 
interrupts. 


NOP 
90 
1 
3 
Perform 
no operation. 


ESC 
mem 
11011xxx 
2,3 
or 
8 + EA 
Place the contents 
of the memory 
location addressed 
by 
mod xxx rim 
4 
mem onto the address/data 
bus. If mod = 11, perform 
no 
(DISPI 
operation. 


(DISP) 


LOCK 
FO 
1 
2 
Guarantee this 8086 
control 
of its bus during the execu- 
tion of the next instruction. 


WAIT 
9B 
1 
3 or 
Enter a WAIT state until external logic drives the TEST pin 
more 
low. 


HLT 
F4 
1 
2 or 
Enter a HALT state. 


more 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


IN 
ac.OX 
1110110w 
1 
8 
[ac! - 
[PORTDX] 


Input to the AL register (8-bit operationl or the AX register 
(16-bit 
operation) from the I/O port specified by the OX 
register. 


IN 
ac.port 
1110010w 
2 
10 
lac) - 
[port] 
kk 
Input to the AL register 18-bit operationl or the AX register 
116-bit 
operation) 
from 
the 
I/O port 
specified 
in the 
second byte of the instruction. 


OUT 
ac.OX 
1110011w 
1 
8 
[PORTDX] - 
[ac) 
Output the contents of the AL register 18-bit operation) or 
the AX register 116-bit operation) to the I/O port specified 
by the OX register. 


OUT 
ac.port 
1110111w 
2 
10- 
[port] - 
lac] 


kk 
Output the contents of the AL register 18-bit operation) or 
the AX register 116-bit operation) to the I/O port specified 
in the second byte of the instruction. 


The software interrupt 
instruction, 
the interrupt 
on overflow instruction 
and the 


return from interrupt 
instruction 
are shown in Table 4-15. 


The software interrupt 
instruction 
is used for two major purposes: 


1. 
To debug programs. 
The single byte software 
interrupt 
instruction 
calls a 


routine whose address is at location 000C16• Typically this routine is part of a 
debug package and is used to handle breakpoints. 


2. 
To call subroutines 
whose 
address 
is present 
in the first 1024 bytes of 


memory. When the two byte software interrupt instruction is used, anyone 
of 


256 subroutines 
whose address has been placed in the first 1024 bytes of 


memory may be called. 


Software interrupt 
instructions 
have the advantage 
of using one or two bytes of 


program memory, 
as compared to five bytes of program memory used by an interseg- 
ment CALL. In addition, the software interrupt 
automatically 
saves the Flags register 


onto the stack - 
a desirable feature in many cases. A minor disadvantage 
of software 


interrupts 
is that if a routine is called via a software interrupt, 
the routine must return 


via an IRET instruction, 
which takes more time than a RET instruction. 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
D 
I 
T 
S 
Z 
A 
P 
C 


INT 
1100110v 
0 
0 
[SP] - 
[SP) - 2, liSP)) -[FLAGS). 
(I! - 
0, [T] - 
0, 


kk(if V = 1) 
(SP) - 
(SP] - 
2, liSP)) - 
[CS). [SP) - 
(SP) - 
2, 


liSP)) - 
[PC). [CS) - 
(vector Isegment part)). 


(PCJ - 
(vector loffset 
part)J 


V=O 
1 
52 
If v = 0, vector loffset part) = [OOOOC16J 
vector Isegment part) = (OOOOE16) 


V = 1 
2 
51 
If v = 1, vector (offset part) = [(kk • 41) 
vector (segment part) = [Ikk • 4) + 2) 


Perform a software 
interrupt. 


INTO 
CE 
1 
53 
0 
0 
If (0) = 1, then [SP) - 
[SPJ - 2, liSP)) - 
[FLAGS). IF - 
0, 


TF - 
0, (SP] - 
[SP) - 
2, [[SP)) - 
(CS). [SP] - 
(SP) - 2, 


(0 = 1) 
([SP))- 
[PCl. (CSJ- 
(00012161. [PC)- 
(00010161. 


4 
If the Overflow flag is set. perform a software interrupt via 
the vector dedicated to overflow 
processing, 
otherwise, 


(0 = 0) 
execute the next sequential instruction. 


IRET 
CF 
1 
24 
X 
X 
X 
X 
X 
X 
X 
X 
X 
[PC) - 
liSP]). [SP) - 
(SP] + 2, [CSJ - 
[(SP)l. [SP) - 
(SP) 
+ 2, (FLAGS) - 
IISP)l. [SP) - 
[SP) + 2 
Return from an interrupt service routine. 


8086 instructions 
that perform rotates and shifts are shown in Table 4-16. 
Rotate and shift instructions 
are frequently used to perform bit testing operations. 


These instructions 
are used by themselves 
or in conjunction 
with logical operations 
to 
test for various bit patterns. To test the low-order bit of a register, a 


ROR 
reg, 1 


instruction 
operates one cycle faster than a 


instruction. 
The ROR instruction 
tests the Carry Status, as opposed to the AND or 
TEST instructions 
where the Zero Status is significant. To test the low-order bit ofa 16- 
bit pointer/index 
register, use: 


TEST 
reg,OOOlH 


instruction. 
Note, however, that the ROR instruction 
alters the contents of the register, 
whereas the TEST instruction 
is non-destructive. 


The high-order 
bit of an 8-bit register or a 16-bit register can be tested, 
as de- 
scribed above, by replacing the ROR instructions 
with ROL instructions. 


Rotate and shift instructions 
perform arithmetic 
operations. 
The arithmetic 
shift 
operations 
can be used to perform 
both multiplication 
and division. The routine 
in 
Figure 4-34 converts 
two ASCII characters 
into their hex equivalent. 
This routine 
assumes 
that the SI register points to the two characters 
(high-order 
byte first) and 
returns the result in the AL register. This routine also ensures that the bytes converted 
lie in the range 0 ~ character ~ 9 or A ~ character ~ F. If the character is out of range, 
the zero status will be 1 on return, 
otherwise the zero status will be O. 


PUSH 
LODSB 
CALL 
JZ 


MOV 
SAL 
MOV 
LODSB 
CALL 
JZ 


OR 
OR 
POP 
RET 


RET 


ENDP 


PROC 


SUB 
JL 
CMP 
JL 
SUB 
CMP 
JGE 
RET 


XOR 
RET 


CONVERT$ASCII$TO$HEX 
ENDP 


TRANSLA TION$ERROR: 


CONVERT$TWO$AS 
CII$TO$HEX 


CONVERT$ASCII$TO$HEX 


DONE: 


TRANNY$ERROR: 


CONVERT$ASCII$TO$HEX 
TRANSLA TION$ERROR 


CL,4 
AL,CL 
AH,AL 


CONVERT$ASCII$TO$HEX 
TRANSLA TION$ERROR 


AL,AH 
AH,OFFH 
CX 


NEAR 


AL,30H 
TRANNY$ERROR 
AL,OAH 
DONE 
AL,07H 
AL,10H 
TRANNY$ERROR 


;SET UP FOR ROTATE 
;ROTATE 
FOUR TIMES 


;SAVE IN AH 


;CREATE THE HEX BYTE 
;TURN ZF=O 


Figure4-34, 
Routine 
to Convert 
Two 
ASCII 
Digits 
to 
Their 
Hex 
Equivalents 


Status 


Mnemonic 
Operands 
Object Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


RCL 
mem/reg, 
110100vw 
2,3 
or 
count = 1, 
X 
X 
Rotate the contents 
of the memory location or register 
count 
mod 010 rim 
4 
reg: 2 
specified by mem/reg 
left through the Carry status. The 
(DISP) 
mem: 
number of bits to rotate is determined by count and will be 


(DISP) 
15 + EA 
either 1 Iv = 0) or the contents of the CL register (v = 1). 


count 
The rotation is performed 
as follows: 


= [CL) 
reg: 
if w = 0 


8+4"[CL) 
7 
6 
5 
4 
3 
2 
1 
0 


mem: 
20 + EA + 
4·[C!.J 


if w = 1 


15 
14 
13 
2 
1 
0 


qHIII==IIJJl 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


RCR 
memlreg, 
110100vw 
2,3 
or 
count = 1, 
X 
X 
Rotate the contents 
of the memory location or register 


count 
mod 011 
rim 
4 
reg: 2 
specified by memlreg 
right through the Carry status. The 


(DISP) 
mem: 
number of bits to rotate is determined by count and will be 


(DISP 
15 + EA 
either 1 (v = 0) or the contents of the CL register (v = 1). 


count 
The rotation is performed 
as follows: 


=[CL] 
ifw 
= 0 
reg: 
8 + 4"(CL] 
7 
6 
5 
4 
3 
2 
1 
0 


mem: 
20 + EA 
+ 4" 
(CL] 


ifw 
= 1 


15 
14 
13 
2 
1 
0 


1f=ITIT==III]J 


Status 
Mnemonic 
Operands 
Object Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


ROL 
memlreg, 
110100vw 
2,3 
or 
count = I, 
X 
X 
Rotate the contents 
of the memory location or register 


count 
mod 000 rim 
4 
reg: 2 
specified by memlreg left. Rotate the high-order bit of the 
(DISP! 
mem: 
operand into the Carry status. The number of bits to rotate 
IDISP) 
15 + EA 
is determined by count and will be either 1 Iv = 0) or the 
count 
contents of the CL register (v = 11. The rotation is per- 
= [CLI 
formed as follows: 


reg: 


w=O 
8 + 4·[CLI 
mem: 
7 
6 
5 
4 
3 
2 
1 
a 


20 + EA 


~ 


+ 4· 
[CLl 


w = 1 


15 
14 
13 
2 
1 
a 
~==IIJ( 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


RDR 
memlreg, 
11010Dvw 
2,3 
or 
count = 1, 
X 
X 
Rotate the contents 
of the memory location or register 


count 
mod 001 rim 
4 
reg: 2 
specified by memlreg right. Rotate the low-order bit of the 
IDISP) 
mem: 
operand into the Carry status. The number of bits to rotate 
IDISP) 
15 + EA 
is determined by count and will be either 1 (v = 0) or the 
count 
contents 
of the CL register (v = 1). The rotation is per- 
= [ell 
formed as follows: 


reg: 


if w = 0 
8 + 4'[CLI 
mem: 
7 
6 
5 
4 
3 
2 
1 
0 


20 + EA 


~ 


+ 4' [CLI 


ifw 
= 1 


15 
14 
13 
2 
1 
0 
1J r=o=r::~ 


Status 
Mnemonic 
Operands 
Object Code 
Bytes 
Clocks 
Operation 
Performed 
0 
D 
I 
T 
S 
Z 
A 
P 
C 


SAL 
mem/reg, 
110100vw 
2,3 
or 
count = 1, 
X 
X 
X 
? 
X 
X 
Shift 
the contents 
of the memory 
location 
or register 


count 
mod 100 rIm 
4 
reg: 2 
specified by mem/reg 
left 
Shift a zero into the low-order 
(DISP) 
mem: 
bit of the operand. The number of bits to shift is deter- 
(DISP) 
15 + EA 
mined by count and will be either 1 (v = 0) or the contents 
count 
of the CL register (v = 11. The rotation is performed as 


=(CLl 
follows: 


reg: 
if w = 0 
8 + 4"rCLl 
mem: 
7 
6 
5 
4 
3 
2 
1 
0 


20 + EA 
I 
I 
I 
I 
I 
I 
I I 
+ 4" [CLl 
C 


+0 


ifw 
= 1 


15 
14 
13 
2 
1 
0 


&ITII:=III( 


0 


SHL 
mem/reg, 
same as SAL 
count 


Status 
Mnemonic 
Operands 
Object 
Code 
Bytes 
Clocks 
Operation 
Performed 


0 
0 
I 
T 
5, • Z 
A 
P 
C 


SAR 
mem/reg, 
110100vw 
2,3 
or 
count = 1, 
X 
X 
X 
] 
X 
X 
Shift the contents of the memory location right. Propag- 


count 
mod 111 rIm 
4 
reg: 2 
ate the sign of the operand by preserving the value of the 
(DISP) 
mem: 
high-order 
bit. The number of bits to shift is determined 
(DISP) 
15 + EA 
by count and will be either 1 Iv = 01 or the contents of the 
count 
CL register (v = 1). the rotation is performed as follows: 


=(CL) 
if w = 0 
reg: 
8 + 4"[CL) 


~ 


mem: 


~ 


20 + EA 
+4"(CL) 


if w = 1 crrrr 
210 


~ 
==ITIjJ 


. 


Status 
Mnemonic 
Operands 
Object Code 
Bytes 
Clocks 
Operation 
Performed 
0 
0 
I 
T 
S 
Z 
A 
P 
C 


SHR 
memlreg, 
111100vw 
2,3 
or 
count = 1, 
X 
X 
X 
? 
X 
X 
Shift 
the contents 
of the memory location 
or register 
count 
mod 101 rim 
4 
reg: 2 
specified by memlreg 
right. Shift a zero into the high- 


(DISP) 
mem: 
order bit of the operand. The number of bits to shift is 
(DISP) 
15 + EA 
determined by count and will be either 1 (v = 01 or the 
count 
contents of the CL register (v = 1). The rotation is per- 


= [CLI 
formed as follows: 


reg: 
if w = 0 
8 + 4·[CLI 


7 
6 
5 
4 
3 
2 
1 
a 


mem: 


1] 


0 
20 + EA 
+ 4 • [CLI 


ifw 
= 1 


15 
14 
13 
2 
1 
a 


qJ 0-0:::0::=IDjJ 


Software 
Development 


• 
An Editor 


Editors are used to enter and/or modify source code. The source code is usually 
saved on some form of mass storage, e.g., floppy disk, hard disk, or in dire emergencies, 
paper tape. The source code is usually organized on mass storage in the form of a file, 
referred to as the source file. Editors create and operate on source files in a number 
of 
different ways depending 
on how the source file may be accessed, e.g., a source file 
residing on magnetic tape is not handled in the same way as a source file residing on 
floppy disk. Users typically request editor functions by entering commands from a video 
terminal. 


• 
An Assembler 


Assemblers are used to translate source code into object code. An assembler reads 
the source file that (in most cases) has been generated 
by the editor, translates 
the 
source code, and then writes the object code to mass storage in the form of a file referred 
to as the object file. An assembler 
may also produce additional files such as a file that 
contains the source code and the object code, referred to as the listing file, and a file 
which contains all the labels and variable names used in the source code, referred to as 
the symbol file. The listing file is usually printed, and then referred to during the debug- 
ging process. 


• 
A Debugger 


Debuggers are used to assist in detecting errors in the object code. A debugger is 
either loaded with the object code generated by the assembler, 
or it is loaded by itself, at 
which point the user can request that an object file be loaded from mass storage. A typi- 
cal debugger allows the user to control the execution of the object code, and to view the 
contents 
of memory and registers. 


The three 
programs 
mentioned 
above 
are the 
major 
tools essential 
for the 
development 
process. 
Additional 
tools which are useful include linkers and loaders. 


Linkers are used to link multiple subprograms 
into one program. Linkers resolve exter- 
nal references 
from subprograms. 
External references 
occur when instructions 
in one 
module refer to a symbol (label or variable name) 
that is defined in another 
module. 


Loaders are used to bring object code from mass storage into memory. 
For the purposes of this discussion, consider a hypothetical system, to be used in 
the software development 
process, which contains the following hardware elements: 


• 
CPU 


• 
RAM 


• 
Floppy disk drives 


• 
CRT terminal 


Throughout 
this discussion 
of editors, 
assemblers, 
and debuggers, 
it will be 
assumed that these support programs are executed on the above system. In addition, as 
each of the support programs is discussed, a hypothetical 
command 
language for the 
support program will be used in examples illustrating typical functions provided by the 
support 
program. 
It should be emphasized 
that both the system and the command 
languages are only examples, and will not be found in the real world. 


Most editors accomplish their function by performing 
some combination 
of the 
following tasks: 


• 
Reading data from mass storage into memory. 


• 
Operating on the data in memory in response to commands 
from the user. 


• 
Writing data from memory out to mass storage. 


Figure 5-2 shows an example of this operation. 


Mass 
Storage 


Mass 
Storage 


- 
-- 
- 
- 


- 
- 
- 
- 
- 
- 
- 
E 
- 
b 


Edited source 
code file 


ditor's 
uffer area 


The following terms are commonly used when describing basic types of editors. 
Buffer: data is read from mass storage into a memory 
area referred 
to as the 
buffer. All editing commands 
operate on the data in the buffer. After the data has been 
processed it is written from the buffer to mass storage. 
Character or Line Pointer: 
The editor maintains a pointer into the buffer. All user 
commands are taken as being relative to this pointer. For example, a command to delete 
four characters would delete four characters after the pointer. Some editors use a pointer 
that refers to a particular 
character 
in the buffer. 
These 
editors are referred 
to as 
character-oriented 
editors. Some editors use a pointer that refers to a particular line. 


These are referred to as line-oriented 
editors. 


What sorts of functions should an editor provide? 
The editor should provide the following capabilities: 


• 
Read data from mass storage to memory 


• 
Write from memory to mass storage 


• 
Insert data into memory 


• 
Delete data from memory 


• 
Change the position of the character/line 
pointer in the buffer 


• 
Display the contents 
of the buffer 


• 
Search the buffer for occurrence of the specified string 


Change the contents 
of the buffer 


System commands 


The sample editor used to illustrate these capabilities will respond to user com- 
mands entered at the CRT terminal. 
A command 
consists of three fields: 


Number 
indicates 
the number 
of times 
that a particular 
command 
is to be 
executed. This field is interpreted as a decimal number. This field may be omitted. If this 
field is omitted, 
a default value of I is assumed. 


Command 
is a single character which indicates the operation to be performed. 


Strings is a sequence of characters. Some commands use one or more strings while 
they are executing. 
This field may be omitted. 
Strings are terminated 
by either a # 
character or the return character. 


All commands 
are terminated 
by a carriage return, 
represented 
by 0. 
The 
following are examples of command 
strings: 


A([) 
10L ([) 
CTHE#AN 
([) 


Append one line to the buffer 
Move pointer 10 lines down in buffer 
Change the string THE to AN 


~ead/write Data to/from Memory 


The editor must 
provide the ability to read or write from the buffer to mass 
torage. The user should be able to specify the amount of data that is to be transferred. 
ypical amounts 
of data would include: 


• 
One or more characters 


• 
One or more lines. For example, 
transferring 
one line would move all data 


until a carriage return 
is detected. 
Transferring 
n lines would move all data 


until n carriage returns are detected. 


• 
An entire buffer. For a read operation 
this would involve moving data from 


mass storage into the buffer until the buffer is full. For a write operation, 
this 
would involve moving data from the buffer to mass storage until the entire 
buffer has been moved to mass storage. 


~dditional 
features that might be useful include: 


• 
An operation that transfers data from mass storage and then deletes the data 
that has been transferred. 


• 
A read or write operation that would transfer data until a specific character is 
detected. For example, this would allow the user to transfer pages of data, that 
is to say transfer all information 
until an end-of-page character 
(form feed) is 
detected. 


As an example, consider the case where the sample editor command to add lines 
to the buffer is A. The command 


I 
A0 


would add one line to the buffer. The command 


I 
lM0 


would add ten lines to the buffer. The command 


!A 0 


ould fill the buffer with information 
from mass storage. The execution 
of the com- 


/TIand 


Mass 
Storage 


Pointer to 


- 
mass storage 


- 


Pointer to 
end of buffer 


Mass 
Storage 


Pointer to 
J-----i 
..- 
mass storage 


Pointer to 
t-----t....- 
end of buffer 


The editor must provide the ability to add data to the buffer. Users typically need 
to perform one of two types of source code insertion. These are: 


Large amounts 
of source code must be inserted. This would occur when the 
source code is first being entered or when a significant revision of the source 
code is being performed. 


One or two lines of source code must be entered. This would occur when the 
debugging process is underway and "bugs" 
are corrected or when the source 
code is first being entered and the user discovers that one or two lines were 
inadvertently 
omitted. 


Most editors respond 
to these two different 
needs by supplying two different 
modes of insertion: a mode by which the user may enter unlimited amounts of data, and 
a mode by which a limited amount of data may be entered by the user. 
Consider the case where the sample editor command 
to insert data is I. Suppose 
the buffer appears as follows: 


MOV 
ADD 
Pointer- 
JNC 


CX.AX 
DX.SP 
EXIT$ST AGE$LEFT 


MOV 
CX.AX 
ADD 
DX,SP 


SHR 
OX,! 
JNC 
EXIT$ST AGE$LEFT 


The editor must provide the ability to remove 
data from the buffer. The user 
should be able to specify the amount 
of source code to be removed. 
Typical amounts 
include: 


One or more lines. Removing one line will remove the line pointed to by the 
line pointer 
or remove 
all data in the buffer beginning 
with the character 
pointed to by the current character pointer until a carriage return is detected. 
Removing n lines would remove the n lines below the current line pointer or 
all data from the current character pointer until n carriage returns have been 
detected. 


Consider 
the case where the sample editor command 
to delete a line from the 
buffer is K. Suppose the buffer appears as follows: 


Pointer_ 
MOV 
ADD 
SHR 
JNC 


CX,AX 
DX,SP 
OX,! 
EXIT$ST AGE$LEFT 


MOV 
CX,AX 


JNC 
EXIT$ST AGE$LEFT 


The editor must provide the ability to move the character/line 
pointer to different 


positions in the buffer. The user should be able to specify the number of characters/lines 
that the pointer is to be moved. Additional capabilities that might be useful include: 


• 
Move the character/line 
pointer to the top of the buffer. 


• 
Move the character/line 
pointer to the end of the buffer. 


• 
Move the character/line 
pointer to a specific line in the buffer. For example, 


the user might be able to request that the character/line 
pointer be moved to 


line number 
11 in the buffer. 


Consider the case where the sample editor command 
to move the character/line 


pointer up or down in the buffer is L. Suppose the buffer appears as follows: 


Pointer_ 
MOV 
ADD 


SHR 
JNC 
TEST 
JZ 


CX,AX 


DX,SP 
DX,1 
EXIT$ST AGE$LEFT 
BX,40H 


DONT$MESS$WITH$BILL 


is entered, 
the buffer would be unchanged; 
however, 
the pointer would point at the 


JZ 
DONT$MESS$WITH$BILL 
instruction. 
If the command 
is 


the 
buffer 
would 
again be unchanged; 
however, 
the 
pointer 
would 
point 
at the 


SHR 
DX,1 instruction. 


The editor should provide the ability to display the buffer on the user's terminal. 


The user should be able to specify the number of characters/lines 
that will be displayed. 
Additional capabilities that might be useful include: 


If the user has a CRT terminal, then it might be useful to display a full screen 
of data automatically. 
In addition, the ability to scroll through the buffer either 


displaying a line at a time or a full screen at a time could also prove useful. 


Consider the case where the sample editor command 
to display a line from the 


buffer on the CRT terminal is T. If the buffer contains: 


IN 
Pointer _ 
CMP 
JNZ 
MOV 


AL. TOUCH$TONE$DECODER$PORT 
AL,COLUMN$4$DIGIT 
TOUCH$TONE$ENCODE 
[DI!.MESSAGE$ST 
ARTED$CODE 


CMP 
AL,COLUMN$4$DIGIT 
JNZ 
TOUCH$TONE$ENCODE 


The editor should provide the ability to search the buffer for the occurrence 
of a 
string 
of characters 
specified 
by the user. 
An additional 
capability 
that would 
be 
extremely useful would be to search all the source code for the occurrence of a particular 
string, For example, when the source code is initially entered, 
typographical errors are 
often present. Using the search facility in combination 
with a change facility allows the 
source code to be corrected with a minimum 
of difficulty. 
Consider the case where the sample editor command 
to search the buffer is S, If 
the buffer appears as follows: 


IN 
Pointer _ 
CMP 


JNZ 
MOV 


AL, TOUCH $TONE$DECODER$PORT 
AL,COLUMN$4$DIGIT 
TOUCH$TONE$ENCODE 
[DI!.MESSAGES$ST 
ARTED$CODE 


is entered, 
the result will depend on whether 
a character 
pointer or a line pointer is 
employed 
by the editor. 
In the case of a line pointer, 
the buffer will be unchanged. 


However, 
the position of the line pointer will be altered as follows: 


IN 
CMP 


Pointer_ 
JNZ 


MOV 


AL, TOUCH$TONE$DECODER$PORT 
AL,COLUMN$4$DIGIT 
TOUCH$TONE$ENCODE 
[DI!.MESSAGE$ST 
ARTED$CODE 


IN 
CMP 


JNZ 
MOV 


AL, TOUCH$TONE$DECODER$PORT 
AL,COLUMN$4$DIGIT 
TOUCH'hONE$ENCODE 
[DI!.MESSAGE$ST 
ARTED$CODE 


The editor must provide the ability to change data in the buffer. The user should 
be able to specify that any string present in the buffer be replaced by a user-specified 
string. The delete facility can be considered a degenerate 
case of the change facility, in 


that the user-specified 
string is a null string. 
Consider the case where the sample editor command to change data in the buffer 


is C stringl :#=string2, where the command functions by locating the next occurrence of 
string 1 in the buffer and replacing it with string 2. If the buffer appears as follows: 


IN 
Pointer- 
CMP 


JNZ 


MOV 


AL. TOUCH$TONE$DECODER$PORT 
AL.COLUMN$4$DIGIT 


TOUCH$TONE$ENCODE 
(DI!.MESSAGE$STARTED$CODE 


IN 


CMP 
Pointer- 
JNZ 


MOV 


AL. TOUC H$TONE$DECODER$PORT 


AL.COLUMN$4$DIGIT 
TOUCH$TONE$ENTRANCE 
[DI!.MESSAGE$ST 
ARTED$CODE 


The editor must provide commands that allow the user to terminate 
the edit ses- 


sion in a reasonable 
manner. 
Reasonable 
termination 
methods 
might include: 


• 
Move all data in the buffer to mass storage. 


• 
Move all unprocessed 
source code through 
the buffer to mass storage. This 
can be considered 
a normal termination 
method. 


• 
Exit the edit immediately 
without flushing the buffer. This method might be 
used 
in the 
case 
where 
unwise 
or 
unfortunate 
user 
manipulation 
has 
catastrophically 
affected the source code. Depending 
upon the form of mass 
storage, the user may be able to restore the source code to its original form. 


The above types of commands 
are necessary components 
for a rudimentary 
edi- 


tor. More sophisticated 
editors would include other capabilities such as: 


1. 
The concatenation 
of individual commands 
into command 
strings. 


2. 
Multiple iterations of command strings. For example, this would be especially 
useful for changing all occurrences 
of a particular string in the source code. 


3. 
More sophisticated 
file-handling. 
In this discussion, 
the concept of files has 
been avoided. The source code has only been discussed in terms of residing 
on mass storage. More advanced editors are typically run from a fast mass 
storage device, e.g., hard-disk, and interface with an operating system which 
provides powerful data file manipulation 
capabilities. 
Indeed, 
some editors 


relieve the user of the responsibility 
of reading and writing from the buffer. 


With these editors, the user can scroll throughout 
the source code without 


worrying about reading/writing 
source code; this function is performed by the 
editor automatically. 


4. 
Arithmetic 
capabilities. 
Some editors 
may also be used as very powerful 


calculators. 


5. 
The ability to extract a certain portion of the buffer and reserve it for later use. 
This ability can be extremely useful if it is necessary to rearrange the source 
code, e.g., 100 lines of source code at the beginning of the file must be moved 
to the middle of the file. 


6. 
The ability to include "ambiguous" 
elements 
in any string operation. 
For 


example, 
the search 
operation 
might use A*CDE to search 
for any five- 


character string where the first character 
is A, the last three characters 
are 


CDE, and the second character may be ambiguous, 
i.e., any character. 


Most assemblers 
perform the following functions: 


• 
Separate assembly language source code into individual statements. 


• 
Break each assembly language statement 
into its component 
parts. These parts 


include labels, assembly language operators, 
assembler 
directives, 
operands 


for the assembly language operators 
and comments. 


• 
Process each of the component 
parts according to the rules of the assembly 
language. From this process, the assembler generates an object code file and a 
symbol table. 


• 
Write files to mass storage. These files would include an object code file, a list- 
ing file (which is comprised of the object code file and the source code file) and 
a symbol table file. 


Separating the source code into the individual assembly language statements 
is a 
fairly easy task since most source code files are organized with one statement 
per line, 
i.e., there is one statement 
between two carriage returns. Some assemblers allow more 


than one statement 
per line; these statements 
are usually separated by a special delimit- 
'ng character which is treated in a similar fashion to the carriage return. 


The function that makes an assembler an extremely useful tool is the processing 


of the individual assembly language statements. 
The individual parts of the assembly 


language statement 
are: 


• 
Labels. There 
mayor 
may not be a label present 
in an assembly language 


instruction. 
If a label is present, it is saved in a symbol table along with the cur- 
rent value of the location counter. 
In more complex assemblers, 
more infor- 


mation may be saved, depending on the type of operator or directive specified. 


• 
Operators. 
Operators 
are either assembly language mnemonics, 
e.g., ADC, 


STD, IN, or assembler 
directives. 
Assembly language mnemonics 
are trans- 


lated by the assembler 
into object code. Some mnemonics 
require operands, 


for the precise object code generated will depend on other information 
present 


in the assembly language statement. 
For example, 
the ADC instruction 
can 


generate hundreds 
of different object codes, but the ADC 
AX,DX instruc- 


tion generates 
a unique object code. 


Assembler directives are used to control several functions the assembler employs 
to generate the object code file and the listing file. These directives control: 


• 
The location 
at which the source code is assembled. 
Programs 
which will 


include absolute 
addresses 
for program memory 
location must be aware of 
where they will reside in memory. Consider the case where the sample assem- 
bler's location specifying directive is ORG. If the assembler directive 


is included in the source code, the assembly language statements 
succeeding 


this one will be assembled 
assuming 
that the program counter 
was set to 
0400H. 


The program's starting address. This is typically saved in the object code. Most 
assemblers allow the starting address of the program to be specified in the last 
statement 
of the source code, the END statement. 
Assuming that the source 
assembler uses the END statement 
to specify the starting address, if the state- 
ment 


is the last source code statement, 
then the assembler 
will generate an object 
code that includes 
the address 
of START$OF$PROG RAM as the starting 


address. 


• 
The format of the listing file. Directives which control the listing file format 
could control the pagination, whether or not certain sections of the source and 
object code are included in the listing file and the various headings associated 
with the listing file. 


• 
The initial values for data memory locations. 


• 
Operands. Operands, for assembly language mnemonics 
or assembly language 
directives that require operands, 
can appear in a variety of different forms: 


Register names 
Numbers 
(in one of a number of different bases) 


Variable names 
Labels 
Strings of ASCII characters 
Expressions 
(a combination 
of any of the above in conjunction 
with arith- 
metic or logical operators) 


• 
Comments. 
Comments 
are used to explain the operation of the program. They 
are ignored by the assembler, 
but are essential to any user who is interested 
in 
modifying the program. 


The translation 
process that the assembler 
performs 
is a fairly straightforward 
task. When the statement 
has been broken into its constituent 
parts, the constituent 
parts are mapped into series of tables which produce pieces of the object code. These 
pieces are then assembled 
into the final object code. 


A debugger is a development 
tool that is used to assist in removing errors from 
object programs. Debuggers, 
like editors and assemblers, 
vary in complexity. The most 
elementary 
debuggers contain elements 
that allow the user to: 


• 
Control Execution 


• 
Display register/memory 


Debuggers allow the user to control execution 
by providing facilities such as: 


• 
A single step facility. A single step facility allows the user to execute the object 
code one instruction 
at a time. The user is able to view the registers/memory 
between each instruction's 
execution, 
which, hopefully, 
is sufficient to allow 
the user to guarantee that the instructions are performing the desired function. 
More complex debuggers contain sophisticated 
forms of the single step routine 
which allow the user to specify the exact number 
of instructions 
to be per- 
formed and to specify the registers or memory locations that will be displayed 
following each instruction's 
execution. 


A breakpoint facility. A breakpoint facility allows the user to control execution 
by placing a special code, or in the case of the 8086, a software interrupt 
instruction, 
into the object code at locations specified by the user. When the 
special code is executed, 
it results in a transfer 
of control to the debugger, 


thereby 
halting the execution 
of the user's 
object code. At this point, 
the 
debugger replaces the original object code at the location modified to contain 
the special code and allows the user to view the state of the machine. 


Debuggers 
typically allow the user to display the contents 
of any portion 
of 
memory and the contents of the CPU's internal registers, thus allowing a complete view 
of the state of the machine. 


More complex debuggers allow the user to: 


• 
Alter memory/register 
contents 


• 
Trace object code execution 


• 
Assemble/disassemble 
object code 


• 
Read/write 
from mass storage 


• 
Perform simple arithmetic functions 


• 
Use more sophisticated 
breakpoints 


• 
Manipulate 
the symbol table 


Typical debuggers 
provide several facilities toward the alteration 
of memory/ 
register contents. 
These would include: 


Examine and optionally alter a memory location 


• 
Fill a sequence of memory locations with a constant 


• 
Move the contents of a block of memory locations to another block of memory 
locations. 


Consider the case where the sample debugger command to fill memory locations 


with a constant 
is F addrb 
addr2, constant. 
This command 
would fill all the memory 


locations from addrl 
to addr2 
(inclusive) 
with constant. 
For example, 
if the debugger 
command 


were entered, 
the debugger would enter the constant 
2016 in all locations from 10016 to 
17F16• 


Debuggers that allow the user to trace the execution 
of object code usually imple- 
ment this function as an extension of the single step facility. The user typically specifies 
the number of steps to be executed and the kind of information 
to be displayed, thereby 
allowing the user to watch (trace) the program's 
execution. 
Some debuggers provide a more sophisticated 
form of memory display in which 
the user can specify that the memory 
contents 
be displayed as assembly 
language 
instructions 
rather than hexadecimal 
numbers. 
For example, instead of 


D 
400A05 


would display memory locations 400-405 as: 


400 
IN AL.10 
402 
AND ALAO 


404 
JZ 400 


In addition, some debuggers provide a basic assembler 
facility that the user may 


employ to alter the contents 
of memory. Instead of substituting 
object code, 


5404 
75. 


405 
FA 


the user may substitute 
source code, 


A404 
JNZ 
400 


A debugger 
often provides an elementary 
ability to read and write data to and 
from mass storage. Typical capabilities include: 


• 
Read an object code file from mass storage to a series of memory 
locations 
specified by the user. 


• 
Write an object code file from memory locations specified by the user to mass 
storage. 


• 
Basic paper tape handling facilities. 


Most debuggers provide hexadecimal arithmetic 
facilities. The user is commonly 
able to enter two hexadecimal numbers and have the debugger calculate and display the 
sum and difference of these two numbers. 
Debuggers that possess more sophisticated 
breakpoint 
facilities usually provide: 


• 
A pass count for each breakpoint. 
Each time an instruction 
is fetched from the 
breakpoint 
address, 
the pass count 
is decremented. 
When the pass count 
reaches zero, the user program is suspended 
and control is returned 
to the 
debugger. 
The user may then view the state of the machine. This feature is 
particularly useful when debugging program loops. If, for example, the fifty- 
third iteration of the loop seems to be causing some difficulty, a breakpoint 
in 
the loop can be established with a pass count of fifty-three, and the offending 
iteration can be viewed at the user's leisure. If a pass count capability is not 
available, stopping the execution 
of the user program during the fifty-third 
iteration is not a trivial task. 


• 
A hardware breakpoint facility that suspends the execution of the user's object 
code when a memory location is accessed for data. This is very useful for occa- 
sions when memory locations are being trashed in an unpredictable 
fashion. 
Breaking on a memory 
access typically allows for the identification 
of the 
source of the problem. 


Sophisticated debuggers often work with a symbol table created by the assembler. 


These debuggers allow the user to reference memory locations by name. This feature is 
very helpful 
when working with relocatable 
object code. Instead 
of calculating 
the 
address 
of a particular 
variable using a load map and a listing, the variable can be 
referenced directly by name. 


Examples 
of 8086 
Assembly 
Language Programming 


This chapter presents two examples of 8086 assembly language programming: 
a 
sort program and an I/O driver. The specification and program design efforts for these 
examples were presented 
earlier in Chapter 2. 


The sort program is divided into three separate modules. These are: 


Read the tape 


Sort the records 


Move subsort to temp 
Compare keys 
Compute pointer 
Move record 


By reviewing the source code, it becomes obvious that not all the subroutine 
calls 
are necessary. For example, the "Read tape record" 
routine is called by only one state- 
ment in the source code. But the source code module is clearer than it would be if the 
entire "Read tape record" routine were included in the source code at the point at which 
is was called. 


The sort program has two abnormal exits, Both of these exits occur in the read/ 
write drivers for the tape controller. For the purposes of this program it is assumed that 
an operating 
system 
is present, 
and that 
this operating 
system 
will display error 
messages at an appropriate 
device, should an anomaly be detected. 


;EQUATES 
FOR SORT ROUTINE 


TAPE$COMMAND$PORT 
EQU 
20H 
;ARBITRARY 
*'S 
TYPICALLY 


TAPE$STATUS$PORT 
EQU 
20H 
;THESE *'S 
WOULD 
BE LISTED 
TAPE$DATA$PORT 
EQU 
22H 
;IN THE SPECIFICATION. 


READ$T APE$COMMAND 
EQU 
OlH 
;FROM SPECIFICATION 


WRITE$T APE$COMMAND 
EQU 
02H 


OPERATION$COMPLETE$FLAG 
EQU 
04H 
TAPE$ERROR$STATUS 
EQU 
OBOH 


TAPE$ERROR$FLAG 
EQU 
044H 
;USED BY SYSTEM 


;EXTERNAL REFERENCES 


EXTRN SYSTEM: 
FAR 


EXTRN SYSTEM$ERROR. 
FAR 


DATA 
SEGMENT 


,RAM LOCATIONS 
FOR SORT PROGRAM 


RECORD$TEMP 
DB 
2 
KEY$TEMP 
DB 
10 


INDEX 
DW 
1 
INTERVAL 
DW 
1 


SUBSORT$COUNTER 
DW 
1 
RECORD$COUNT 
DW 
1 


TAPE$BUFFER 
DB 
140 DUPP! 


SORT$AREA 
DB 
4000 
DUP (12 DUPIO)) 
DATA 
ENDS 


CODE SEGMENT 
ASSUME 
CS: CODE, DS 
DATA, 
ES: DATA 


MAIN. 


MOV 
AX,DATA 
;LOAD SEGMENT REGISTERS 


MOV 
DS,AX 


MOV 
ES,AX 


MOV 
RECORD$COUNT,O 
;SET * 
OF RECORDS TO 0 


MOV 
DI,OFFSET SORT$AREA 
;POINT DI AT SORT AREA 


, READ THE TAPE MODULE OPERATES BY 
,1 
READING A TAPE RECORD 


,2 
CHECKING FOR DONE 


,3 
MOVING 6 WORDS 
FROM THE TAPE BUFFER TO THE SORT AREA 


,4 
UPDATING THE * 
OF RECORDS 


READ$THE$TAPE 
CALL 
READ$TAPE$BUFFER 
;READ 12B BYTES 


MOV 
SI.OFFSET TAPE$BUFFER 
,TEST FOR EOF RECORD 


CMP 
ISII,OFFFFH 
,GO SORT IF EOF 
JZ 
SORT 
;NOT EOF, MOVE RECORD # 


MOV 
CX,12 
;AND KEY 


REP 
MOVS TAPE$BUFFER, 
,INCREMENT # OF RECORDS 


SORT$AREA 
INC 
RECORD$COUNT 
.GET ANOTHER BYTE 


JMP 
READ$THE$T APE 


is used to generate object code that will load the address of TAPE$BUFFER 
into the SI 
register as immediate 
data. Note that the statement 


MOV 
SI.TAPE$BUFFER 


would generate object code that will load the contents 
of TAPE$BUFFER 
into the SI 
register. The OFFSET operator is a feature of the standard Intel 8086 assembler; it is not 
a characteristic 
of the 8086 microprocessor. 


Note that moving an even number of bytes is easier than moving an odd number 
of bytes. To move an odd number of bytes, two approaches are possible. One approach 
is: 
CX. ODD$NUMBER 
MOVSB 


;LOAD * OF BYTES 
;TO MOVE 


This approach uses the same number 
of object code bytes as is used moving an even 
number 
of bytes; but it takes twice as long to execute. 
Another 
method is: 


MOV 
CX.ODD$NUMBER 
;LOAD * OF WORDS 
TO MOVE 
SHR 
CX,l 
REP 
MOVSW 
MOVSB 
;MOVE LAST BYTE 


This method 
requires one additional byte of object code, but it executes in the same 
amount 
of time as a routine to move an even number 
of bytes. 


THE SORT MODULE IS A STRAIGHTFORWARD 
RENDITION OF THE ALGORITHM 


PRESENTED IN CHAPTER 3. 


SORT: 
;INITIALIZE INTERVAL TO 
;RECORD COUNT 
MOV 
MOV 


SHR 
JZ 


AX.RECORD$COUNT 
INTERVAL. AX 


INTERVAL,l 
WRITE$TO$T 
APE 


AX,RECORD$COUNT 
;SUBSORT 
CTR=RECORD$ 


COUNT- 
INTERVAL 
SUB 
MOV 


INC 
MOV 
CMP 


JG 
CALL 


MOV 
SUB 
MOV 
CALL 
JGE 


AX.INTERVAL 
SUBSORT$COUNTER,AX 


SUBSORT$COUNTER 
AX.SUBSORT$COUNTER 
AX,RECORD$COUNT 
;TEST FOR NEW INTERVAL 


NEW$INTERVAL 
MOVE$SUBSORT$ 
TO$TEMP 
;SAVE CURRENT RECORD 


AX, SUBSORT$COUNTER 
INDEX = SUBSORT 
CTR-INTERV AL 


AX,INTERVAL 
INDEX,AX 
COMPARE$KEYS 
FOUND$THIS$RECORDS 
$SPOT 


MOV 
AX,INDEX 


CALL 
COMPUTE$POINTER 
MOV 
SI,AX 
CALL 
MOVE$RECORD 


MOV 
SUB 


JGE 


MOV 
CALL 
JMP 


AX,INTERVAL 
;INDEX-INTERVAL=INDEX 


INDEX,AX 


TEST$KEYS 


SI,OFFSET RECORD$TEMP 
MOVE$RECORD 
NEXT$SUBSORT$COUNTER 


Note 
that 
the instruction 


SHR 
INTERVAL,1 


is more 
efficient 
than 
the sequence 


MOV 
AX,INTERVAL 
SHR 
AX,1 
MOV 
INTERVAL,AX 


in both 
memory 
usage and time consumption, 
In all cases, 
it is more 
efficient 
to operate 
on 
a 
memory 
location 
directly, 
rather 
than 
bringing 
information 
into 
a 
register, 


manipulating 
the information, 
and then 
returning 
the result 
to a memory 
location, 


WRITE TAPE OPERATES BY 
1. 
INITIALIZING PTRS TO THE TAPE BUFFER AND SORT AREA 


2. 
MOVING 
12 BYTES AT A TIME UNTIL EITHER 


3 
IF 128 BYTES, WRITE A TAPE RECORD 


4 
IF END OF FILE, APPEND AN EOF RECORD, THEN WRITE 
THE LAST TAPE RECORD 


WRITE$TO$TAPE: 
NEXT$TAPE$BUFFER: 
MOVE$NEXT$RECORD 


MOV 
MOV 
MOV 
REP 


CMP 
JL 


SI,OFFSET SORT$AREA 
DI.OFFSET TAPE$BUFFER 
CX,12 
;GET READY TO MOVE 12 BYTES 


MOVS 
TAPE$BUFFER,$SORT$AREA 


DI,OFFSET TAPE$BUFFER + 128 
UPDATE$RECORD$COUNT 


PUSH 
SI 
PUSH 
01 


CALL 
WRITE$T APE$BUFFER 


;TEST FOR MOVED FULL BUFFER 


;SAVE POINTERS 


;WRITE 
128 BYTES TO TAPE 


;RESTORE POINTERS 
POP 
01 


POP 
SI 
MOV 
AX,OFFSET 
TAPE$BUFFER + 128 
;ANY EXTRAS IN END OF TAPE 


SUB 
DI,AX 
;BUFFER 


NOTE. TO FILL 128 
BYTES REQUIRES MOVING 
11 RECORDS OR 11 X 12 = 


132 BYTES INTO TAPE BUFFER 


CX,DI 
DI.OFFSET TAPE$BUFFER 
UPDATE$RECORD$COUNT 
SI 
SI,AX 


MOVSTAPE$BUFFER,TAPE$BUFFER;MOVE 
~XTRAS 
DOWN TO START 


POP 
SI 
;OF TAPE BUFFER 


RECORD$COUNT 
MOVE$NEXT$RECORD 
DI,OFFSET TAPE$BUFFER 
WRITE$EOF 
CX, OFFSET TAPE$BUFFER + 128 
CX,DI 
AL,AL 
STOS TAPE$BUFFER 
WRITE$TAPE$BUFFER 
TAPE$BUFFER,OFFFFH 
WRITE$TAPE$BUFFER 
SYSTEM 


MOV 
MOV 
JZ 
PUSH 
MOV 


REP 


DEC 
JNZ 
CMP 
JZ 
MOV 
SUB 
XOR 
REP 
CALL 
MOV 
CALL 
JMP 


;TEST IF ONE MORE RECORD 
;MUST 
BE WRITTEN 
BEFORE EOF 


;ZERO OUT THE REST OF 
;THE TAPE BUFFER 


;WRITE LAST TAPE RECORD 
;MOVE IN END OF FILE RECORD 
;WRITE EOF RECORD, A 
;RECORD WITH 
FFFF IN 


;THE FIRST TWO 
BYTES 
;END OF PROGRAM 
;RETURN TO SYSTEM 


;PROCEDURES CALLED BY MAIN PROGRAM 


COMPUTE$POINTER 
PROC 
NEAR 
AX HAS INDEX 
RETURN ADDR IS IN AX 


, 
DX IS NOW 0 


MOV 
MUL 
ADD 


RET 


COMPUTE$POINTER 
ENDP 


CX,,2 
CX 
AX,OFFSET 
SORT$AREA 


This module's 
speed can be increased by replacing 


MOV 
CX,'2 
MUL 
cx 


SHL 
AX,' 
SHL 
AX,' 
MOV 
CX,AX 


SHL 
AX,' 
ADD 
AX,CX 


The MOVIMUL sequence 
requires 
126 cycles to execute, The second sequence 
requires 
11 cycles to execute and does not destroy the DX register. The MOV IMUL 
sequence, 
however, 
requires 
only 5 bytes of program 
memory, 
whereas the second 
sequence 
requires 
10 bytes. 


WRITE TAPE BUFFER OPERATES BY 
,. 
POINTING AT THE TAPE BUFFER 


2. 
INITIALIZING THE TAPE CONTROLLER FOR A WRITE 


3. 
CHECKING FOR STATUS 
ERRORS 


4. 
CHECKING FOR OPERATION DONE 


5. 
WRITING TO THE TAPE DATA 
PORT 


WRITE$TAPE$BUFFER 
PROC 
NEAR 
MOV 
SI.OFFSET TAPE$BUFFER 
MOV 
AL,WRITE$TAPE$COMMAND 
OUT 
TAPE$COMMAND$PORT,AL 


AL,TAPE$STATUS$PORT 
;CHECK FOR ERRORS 
AL, TAPE$ERROR$ST ATUS 


OUTPUT$TAPE$ERROR 


AL,OPERATION$COMPLETE$FLAG 
;TEST FOR DONE 


WRITE$COMPLETE 


IN 
TEST 


JNZ 


TEST 


JNZ 


LODSB 
OUT 
JMP 
MOV 
JMP 


WRITE$COMPLETE: 
RET 


WRITE$T APE$BUFFER 
ENDP 


;GET A BYTE 
TAPE$DATA$PORT,AL 
;SHIP IT OUT 
GET$T APE$ST ATUS 
AH, TAPE$ERROR$FLAG 
SYSTEM$ERROR 


TEST 
Al, TAPE$ERROR$ST ATUS 


is used instead of an AN D operation so that the status byte in AL will be preserved for a 
subsequent 
operation, Since the contents of the AL register are not significant after the 


test for operaton completed, 
the 


TEST 
Al,OPERATION$COMPlETE$FlAG 


operation could be replaced by 


AND 
Al,OPERATION$COMPlETE$FlAG 


READ TAPE BUFFER OPERATES BY 
1. INITIALIZING TAPE CONTROllER 
TO READ 


2. CHECKING FOR TAPE ERRORS 
3. CHECKING FOR COMPLETION 
4. READING DATA 
FROM TAPE DATA 
PORT 


PROC 
MOV 
MOV 
OUT 


IN 
TEST 


JNZ 


TEST 
JNZ 


IN 
MOV 
INC 
JMP 


MOV 
JMP 


READ$COMPlETE: 
RET 


READ$TAPE$BUFFER 
ENDP 


NEAR 
SI,OFFSET TAPE$BUFFER 
;POINT AT TAPE BUFFER 


Al,READ$TAPE$COMMAND 
;TEll 
TAPE TO READ 
TAPE$COMMAND$PORT,Al 
Al, TAPE$ST ATUS$PORT 
Al,TAPE$ERROR$STATUS 
;CHECK FOR TAPE ERRORS 


TAPE$ERROR 


Al,OPERATION$COMPlETE$FlAG 
;CHECK FOR DONE 


READ$COMPlETE 


Al,TAPE$DATA$PORT 
;GET DATA 


ISll,Al 
;SAVE DATA 


SI 
GET$STATUS 


AH, TAPE$ERROR$FlAG 
;CAll 
SYSTEM 


SYSTEM$ERROR 
;ERROR PROCESSOR 


Note 
the similarities 
between 
this routine 
and the WRITE$T APE$BUFFER 
routine, There are two differences between the read and write routines. 


WRITE$TAPE$BUFFER: 
MOV 
Al,WRITE$TAPE$COMMAND 


is replaced by 


READ$T APE$BUFFER: 
MOV 
Al,READ$TAPE$COMMAND 


and 


lODSB 
OUT 
Al, TAPE$DAT A$PORT 


is replaced by 


IN 
Al, TAPE$DA TA$PORT 
MOV 
ISll,Al 
INC 
SI 


As an exercise for the reader, meld these two routines together so that the read 


~ 


nd write tape buffer routines share common code. 


Note that it would be more effective to use the DI register as a pointer into the 
ape buffer, e.g., 


MOV 
[SII.AL 
INC 
SI 


STOSB 


f-Iowever, the DI register is used in the main line of code by the READ$THE$T APE 
nodule. 


COMPARE KEYS OPERATES BY COMPARING 
KEY (INDEX) WITH KEYTEMP 


1. CALCULATE 
INDEX 
2. COMPARE KEYS UNTIL 


DIFFERENCE IS FOUND 
10 BYTES HAVE BEEN COMPARED 


PROC 
MOV 
CALL 


INC 
INC 
MOV 
MOV 


MOV 
CMPS 


NEAR 
AX,INDEX 
COMPUTE$POINTER 


AX 
AX 
DI,AX 
SI,OFFSET KEY$TEMP 


CX,OOlO 
KEY$TEMP, 
SORT$AREA 


MOVE RECORD OPERATES BY MOVING WHATEVER 
SI POINTS AT TO THE LOCATIONS 


POINTED TO BY INDEX INTERVAL 
1. 
CALCULATE 
PTA. FOR INDEX + INTERVAL 


2. 
MOVE 12 BYTES 


MOVE$RECORD 
PROC 
MOV 
ADD 


CALL 
MOV 
REP 


NEAR 
AX,INDEX 
AX,INTERVAL 


COMPUTE$POINTER 
DI,AX 
MOVS SORT$AREA,SORT$AREA 


If it is necessary to save time as opposed to memory space, the sequence 


MOV 
DI.AX 
REP 
MOVSB 
RET 


MOV 
DI,AX 
SHR 
CX,1 
REP 
MOVSW 
RET 


Replacing MOVSB with MOVSW saves 6 x 17 = 102 clock periods. Subtract the 
time required 
for the shift instruction, 
2 cycles, and a net savings of 100 cycles is 


realized. 


Note that, for some assemblers, 
specifying the operands determines 
byte or word 
operations 
from the operand types. For our example 


MOVS 
SORT$AREA, 
SORT$AREA 


is a byte operation since SORT$AREA 
is a buffer of bytes, The MOVSB and MOVSW 
forms of MOVS tell the assembler 
what to do, ignoring operand specification and type 
checking. For concise and readable code, the former technique 
is often preferred, 
while 
the latter allows overrides for efficiency as in the above example. 


MOVE SUBSORT 
TO TEMP OPERATES BY: 


1. 
CALCULATING 
SUBSORT 
PTR. 
2. 
LOADING POINTER TO RECORD TEMP. 
3. 
MOVING BYTES 


PROC 
MOV 
CALL 
MOV 
MOV 
REP 


NEAR 
AX,SUBSORT$COUNTER 
COMPUTE$POINTER 
SI.AX 
DI.OFFSET RECORD$TEMP 
MOVSB 
;COMPUTE 
POINTER RETURNS 


;CX = 12 


As with the previous 
module, 
the time required 
to execute 
this code can be 
decreased 
by replacing the sequence 


The coding of the I/O driver program poses two basic problems: 


• 
How should these routines be called by external routines wishing to use them? 


How will the parameters 
be passed to these routines? 


These routines can be called in one of three basic ways: 


Call the operating system, 
which will pass the request on to the appropriate 
module 


Call a command 
handler for the entire driver, 
which will screen parameters 
and then pass control to the appropriate 
module 


• 
Call each of the routines 
directly, 
using an address 
present 
in the calling 
routine's 
code. 


In this example we will assume that the routines 
are called directly; the calling 
program knows the entry address for each routine. This does not preclude the use of the 
first two methods; 
if either of these methods is preferable, 
a simple dispatch table that 
points to each of the modules would allow the operating system or command handler to 
distribute 
the command 
to the appropriate 
module. 
As mentioned 
in Chapter 2, these are the three methods used to pass parameters: 


• 
In the registers 


• 
In a task block 


• 
On the stack. 


In this example, the parameters will be passed in the registers, except in the case 
of multiple character input and output routines. For these functions task blocks will be 
defined. Multiple character input uses the following task block: 


Byte *0: 
1 
1 Maximum number of characters to read 


Byte * 1: 
1 Number of bytes actually read 
I} Bytes 2 through n contain the 
information read by the multiple 
-----1 
character input routine 


Byte *n: 
1 
, 


____ 
.. 1 Number of bytes to output 


I} Bytes 1 through n are sent to the 
channels data port 
1"""---1 
Byte *n: 
, 


CONTROL$PORT 
EQU 
12H 


STATUS$PORT 
EQU 
12H 


DATA$PORT 
EQU 
10H 


IF BIT 0 OF THE AH REGISTER IS 1, THE USER HAS LOADED SI WITH A 
POINTER TO THE STRING TO BE SENT TO THE CONTROL PORT. IF BIT 0 IS A 0, A 
STANDARD 
INITIALIZATION 
STRING WILL BE SENT 


USER$INITIALIZATION$BIT 
EQU 
01 H 


TIMEOUT$VALUE 
EQU 
OFOOOH 


; 
BITS 3, 4, AND 5 OF THE SIO STATUS 
BYTE ARE ERROR BITS 


SIO$ERRORS 
EQU 
38H 


; 
BIT 1 INDICATES WHETHER OR NOT THE RECEIVER IS READY 


; 
BIT 0 INDICATES WHETHER OR NOT THE TRANSMITTER 
IS READY 


SIO$RECEIVER$READY 
EQU 
02H 


SIO$TRANNY$READY 
EQU 
01 H 


TIMEOUT$ERROR$FLAG 
EQU 
OFFH 


.; 
CARRIAGE RETURN IS TERMINATION 
CHARACTER 
FOR READ 


CARRIAGE$RETURN 
EQU 
ODH 


; 
'$' 
IS TERMINATION 
CHARACTER 
FOR WRITE 


TERMINATION$CHARACTER 
EQU 
24H 


EXTRN SYSTEM$ERROR: 
FAR 


CODE SEGMENT 
ASSUME 
CS: 
CODE 


THE INITIALIZATION 
OPERATES BY: 


1. 
TESTING FOR USER SPECIFIED OR 
SYSTEM 
INITIALIZATION 
STRING 


2. 
SENDING THE STRING TO THE CONTROL PORT, 
TERMINATING 
WHEN A 0 IS DETECTED 


INITIALIZATION 
PROC 
AND 
JNZ 
MOV 
SI$LOADED$BY$USER: 
LODSB 
OR 
JZ 
OUT 
JMP 


PORT$INITIALIZATION$STRING 
DO$A$RETURN: 
INITIALIZA TION 


NEAR 
AH,USER$INITIALIZATION$BIT 
;TEST FOR USER INIT 


SI$LOADED$BY$USER 
SI.OFFSET PORT$INITIALIZATION$STRING 
;LOAD STANDARD 
STRING 


AL,AL 
;SET FLAGS TO TEST FOR 0 
DO$A$RETURN 
;EXIT IF 0 
CONTROL$PORT,AL 
SI$LOADED$BY$USER 


DB 
OCEHAOH,OCEH,37H,00H 
RET 
ENDP 


A four-byte initialization 
string is necessary to allow for the fact that the 8251 is 


not in a known state when this routine is called. If, for example, a two-byte sequence 


CE16 
Mode 


3716 
Command 


were sent to the 8251, the 8251 would not be correctly initialized, since it could have 
been waiting for a Command 
Control Input. If a three-byte 
sequence 


401 6 
Command 
(Resed 
CF16 
Mode 
3716 
Command 


were sent, the 8251 would not be correctly initialized if it happened to be waiting for a 
Mode Control1nput. 
The four-byte sequence, however, will correctly initialize the 8251 
regardless of its prior state. 


SINGLE CHARACTER 
INPUT OPERATES BY: 


1. 
LOADING TIMEOUT 
VALUE 


2. 
READING THE STATUS 
PORT AND TESTING FOR SIO ERRORS 


3. 
CHECKING FOR TIMEOUT 
ERRORS 


4. 
READING THE DATA 


IF ZFLAG IS 1 ON RETURN - ERROR CONDITION 
IF ZFLAG IS 0 ON RETURN - NORMAL 
OPERATION 
ERROR CONDITIONS 
RETURNED IN AH 


INPUT$TIMEOUT$ERROR$ 
RETURN: 


SINGLE$CHARACTER$INP 
UT 


PROC 
MOV 


IN 
TEST 


JNZ 
DEC 
JZ 
AND 
JZ 


IN 
RET 


MOV 
XOR 
RET 


MOV 
RET 


ENDP 


NEAR 
CX, TIMEOUT$VALUE 


AL,STATUS$PORT 
;READ STATUS 
AL,SIO$ERRORS 
;CHECK FOR ERRORS 


INPUT$ERROR$RETURN 
CX 
;CHECK FOR TIMEOUT 
INPUT $TIMEOUT$ERROR$RETURN 
AL,SIO$RECEIVER$READY 
;RECEIVER READY? 


TEST$ST ATUS 


AL,DATA$PORT 
;GET VALUE 


AH,AL 
AL,AL 


;SAVE STATUS 
;SET ZERO FLAG 


DEC 
CX 
JZ 
INPUT$TIMEOUT$ERROR$RETURN 
AND 
AL,SIO$RECEIVER$READY 


LOOP 
MOV 
RET 


AND 


NO$TIMEOUT 
AH,OFFH 


IJ his would result in shorter and faster object code. However, it would sacrifice source 
code clarity. 


SINGLE CHARACTER 
OUTPUT 
OPERATES BY: 


1. 
LOADING TIMEOUT 
VALUE 


2. 
READING STATUS 
PORT 


3. 
CHECKING FOR TIMEOUT 
ERROR 


4. 
SENDING DATA 
TO OUTPUT 
PORT IF TRANSMITTER 
IS READY 


THIS ROUTINE USES AX, CX, AND DH 


SINGLE$CHARACTER$OUTPUT 
PROC 
MOV 
MOV 


TRANNY$READY: 
IN 
TEST 
JNZ 
DEC 
JZ 
AND 
JZ 


MOV 
OUT 
RET 


OUTPUT$TIMEOUT$ERROR$ 
MOV 


RETURN: 
RET 


OUTPUT$ERROR$RETURN: 
MOV 
XOR 
RET 


SINGLE$CHARACTER$OUTPUT 
ENDP 


NEAR 
CX, TIMEOUT$VALUE 
DH,AL 
AL,STATUS$PORT 
AL,SIO$ERRORS 
OUTPUT$ERROR$RETURN 
CX 
;TEST FOR TIMEOUT 


OUTP UT$TIMEOUT$ERROR$RETURN 
AL,SIO$TRANNY$READY 
;CHECK FOR TRANSMITTER 
READY 


TRANNY$READY 


AL,DH 
GET DATA 
FROM DH 


DATA$PORT,AL 


AH,AL 
AL,AL 


As with the previous module, 
the timeout error return could be included in the 
mainline code, This would be accomplished 
by replacing 


DEC 
cx 


JZ 
OUTPUT$TIMEOUT$ERROR$RETURN 
AND 
AL,SIO$TRANNY$READY 


LOOP 
NO$TIMEOUT 
MOV 
AH,OFFH 
RET 


AND 
AL,SIO$TRANNY$READY 


( HECK$CHANNEL$ST 
A TUS 
PROC 
NEAR 
IN 
AL,ST ATUS$PORT 
;READ 
RET 


qHECK$CHANNEL$STATUS 
ENDP 


END$CONTROL$INFORMA 
TION 
PROC 
NEAR 


OUT 
AL,CONTROL$PORT 
;WRITE 
RET 


END$CONTROL$INFORMATION 
ENDP 


MUL TIPLE CHARACTER 
INPUT OPERATES BY: 
1. 
GETTING 
'**' OF BYTES TO READ 


2. 
CALLING SINGLE CHARACTER 
INPUT UNTIL 


ERROR FROM SINGLE CHAR 
THE MAXIMUM 
'**' OF CHARACTERS 
HAVE BEEN ENTERED 


A TERMINATION 
CHARACTER 
ICARRIAGE RETURN) IS ENTERED 


THIS ROUTINE IS CALLED WITH SI POINTING AT THE TASK 
BLOCK 
f" '0",'" "'" ".0'.". " 
ULTIPLE$CHARACTER$INPUT 
PROC 
NEAR 
LODSB 
OR 
AL,AL 
;LOAD MAX 
'**' OF BYTES TO READ 


JZ 
ZERO$COUNT$THEN$RETURN 
MOV 
DL,AL 
;SAVE MAX 
'**' IN DL 
MOV 
DI,SI 
INC 
DI 
;POINT AT BUFFER 


9ET$A$CHARACTER: 
CALL 
SINGLE$CHARACTER$INP 
UT 
;GET CHARACTER 


JZ 
INPUT$ERROR 


STOSB 


I 
INC 
BYTE PRT [SII 
;INCREMENT 
'**' READ 
CMP 
DL,[SI) 
;TEST FOR READ MAXIMUM 
'**' 


I 


JZ 
ZERO$COUNT$THEN$RETURN 


CMP 
AL,CARRIAGE$RETURN 
JNZ 
GET$A$CHARACTER 


ZERO$COUNT$THEN$RETURN: 
RET 


~PUT$ERROR: 
JMP 
SYSTEM$ERROR 


IULTIPLE$CHARACTER$INPUT 
ENDP 


T*e sequence 


OR 
AL,AL 


JZ 
ZERO$COUNT$THEN$RETURN 


c ecks for a zero number 
of bytes to be read. 
DI is used as the pointer into the input buffer because of the string primitive, 
S aSH, which saves the data and increments 
the pointer 
in one instruction. 
This 
as urnes that the Direction Flag is set correctly. 


MULTIPLE CHARACTER 
OUTPUT 
OPERATES BY: 


1. 
GETTING THE NUMBER OF CHARACTERS 
TO WRITE 


2. 
CALLING SINGLE CHARACTER 
OUTPUT 
UNTIL 


• 
ERROR FROM SINGLE CHARACTER 
OUTPUT 


• 
THE MAXIMUM 
'*" OF CHARACTERS 
HAVE BEEN WRITTEN 


THIS ROUTINE USES AX. SI. DX. AND CX 


MUL TIPLE$CHARACTER$OUTPUT 
I PROC 
LODSB 
OR 
JZ 
MOV 


LODSB 
CMP 
JZ 


CALL 
JZ 
DEC 
JNZ 


DO$RETURN: 
RET 


OUTPUT$ERROR: 
JMP 


MUL TIPLE$CHARACTER$OUTPUT 
ENDP 


AL.AL 
DO$RETURN 
DL.AL 


AL, TERMINATION$CHARACTER 
DO$RETURN 


SINGLE$CHARACTER$OUTPUT 
OUTPUT$ERROR 
DL 
OUTPUT$A$CHARACTER 


8086 
Microprocessor 
Description 


This is the first chapter devoted to methods of interfacing the 8086 with external 
logic. Therefore 
this chapter 
will examine 
the signals that the 8086 produces 
and 
receives, and will look, in overview, at 8086 system concepts. 


8086 CPU pins and signals are illustrated 
in Figure 7-1. With the exception 
of 
those signals that are specifically labelled, all inputs and outputs are TTL-Ievel compati- 
ble. 
All microprocessors 
produce or receive the following kinds of signals: 


• 
Address Lines 
Data Lines 
• 
Control and Status Lines 


• 
Power and Timing Lines 


The 8086's 40-pin package has all four types of signals. Some pins carry more than 
one type of information. 
For example, the data and address lines are multiplexed. 
Other 
pins have their functions defined by the level present at the MN/MX 
pin. 
The discussion that follows will describe the function of each pin when MN/MX is 
high or low. 
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VCC 
AD15 
A16/S3 
A 17/S4 
A 18/S5 
A19/S6 
BHE/S7 
MN/MX 
RD 
RO/GTO, HOLD 
RO/GT1, HLDA 
LOCK, WR 
S2, M/IO 
ST, DT/A 
SO, DEN 
OSO, ALE 
OSl, 
INTA 
TEST 
READY 
RESET 


Pin Name 
Description 
Type 


ADO-AD15 
Data/ Address bus 
Bidirectional, tristate 
A 16/S3, A 17/S4 
Address/Segment 
identifier 
Output, tristate 


A18/S5 
Address/Interrupt 
enable status 
Output, tristate 


A19/S6 
Address/status 
Output, tristate 


BHE/S7 
High-order byte/status 
Output, tristate 
RD 
Read control 
Output, tristate 
READY 
Wait state request 
Input 
TEST 
Wait for test control 
Input 
INTR 
Interrupt request 
Input 
NMI 
Non-maskable interrupt request 
Input 
RESET 
System Reset 
Input 


CLK 
System Clock 
Input 


MN/MX 
= GND for a maximum system 


sO,5i 
S2 
Machine cycle status 
Output, tristate 


RO/GTO, RO/GT1 
Local bus priority control 
Bidirectional 


OSO,OS1 
Instruction queue status 
Output 


LOCK 
Bus hold control 
Output, tristate 


MN/MX 
= VCC for a minimum system 
M/IO 
Memory or I/O access 
Output, tristate 
WR 
Write control 
Output, tristate 


ALE 
Address Latch enable 
Output 
DT/R 
Data transmitfreceive 
Output, tristate 
DEN 
Data enable 
Output, tristate 


INTA 
Interrupt acknowledge 
Output, tristate 


HOLD 
Hold request 
Input 
HLDA 
Hold acknowledge 
Output 


VCC,GND 
Power, ground 


D Maximum System Signals D Minimum System Signals 


The 8086 CPU can directly address one million (l M) bytes of memory - 
which 
means that 20 bits of address information 
are necessary. 
The 8086 CPU accesses data in 16-bit units, treated as a low-order 
byte and a 
high~order byte. 


To allow for a 20-bit address bus and a 16-bit data bus on a 40-pin package, the 
data bus is multiplexed 
with the least significant 
16 bits of the address bus. The four 
additional address lines are multiplexed 
with status information. 
The address/data 
and 
address/status 
signals are: 


ADO-AD15 
A16/S3 
A17/S4 
A18/S5 
A19/S6 
M/S7 


Let us examine these lines in detail. 
ADO-AD15. These 16 lines are the multiplexed 
address bus/data 
bus lines. Dur- 
ing the first clock period of a bus cycle, these lines contain the low-order 16 address bits. 
During all other clock cycles, these lines are used as the data bus. These lines are put 
into the high impedance state when the 8086 is performing 
an interrupt 
acknowledge 
cycle or a "hold acknowledge" 
cycle. 


A16/53. 
During the first clock period of an instruction's 
execution, 
this line 
serves as address line 16. If an I/O instruction 
is performed, 
this line is low during the 
first clock period. During all other clock periods, this line is used in conjunction with line 
A 17/S4 to provide status information. 
A17/54. During 
the first clock period of an instruction's 
execution, 
this line 
serves as address line 17. If an I/O instruction 
is performed, 
this line is low during the 
first clock period. During other cycles, this line is used in conjunction 
with line A16/S3 
to provide status information. 


During all clock periods but the first, A 16/S3 and A 17/S4 provide information 
that specifies which segment 
register is producing 
the segment 
portion of the 8086 
address, as follows: 


A17/54 
A16/53 
Meaning 


0 
0 
Extra Segment 


0 
1 
Stack segment 
1 
0 
Code segment or no segment 
1 
1 
Data segment 


This information 
can be used by external logic to expand the 8086 memory space 
so that each segment register addresses its own unique 1M byte of memory, but it would 
then be impossible to overlap memory addresses computed 
in different segments. 


A18/55. 
During 
the first clock period of an instruction's 
execution, 
this line 
erves as address line 18. If an I/O instruction is executed, this line is low during the first 
lock period. During all other clock periods, this line reflects the state of the 8086's 
Interrupt 
Enable flag. 


A19/56. 
During the first clock period of an instruction's 
execution, 
this line 
serves as address line 19. If an I/O instruction 
is performed, 
this line is low during the 


first clock period. During all other cycles, the 8086 holds this line low if it is controlling 
the system bus. During a "hold acknowledge" 
clock period, the 8086 will float this line, 


allowing another 
bus master to take control of the system bus. 
BHE/S7. During the first clock period of an instruction's 
execution, 
this line is 


used as BHE. BHE is held low during read, write, and interrupt acknowledge sequences 
in which data is to be transferred 
on the high-order eight bits of the data bus. This signal 


is used in conjunction 
with the ADO line to generate select logic for memory banks. A 


more extensive discussion of 8086 memory selection is present in the next section. Dur- 
ing the second and subsequent 
clock periods BHE/S7 maintains 
its first clock period 


output level. 


8086 control and status lines can be divided into two categories: 
those which are 


not affected by the level at the MN/MX pin and those whose function depends on the 
value at the MN/MX 
pin. Those which are not affected include: 


RD 
READY 
TEST 
INTR 
NMI 
RESET 


RD is output low when the CPU is reading data from a memory location or an I/O 


device. The S2-(M/IO) 
pin specifies whether a memory or I/O access is requested. 


READY is used by a selected memory or I/O device to indicate that it is ready to 


accomplish the data transfer operation. A signal (RDY 1 or RDY2) is input to the 8284 
Clock Generator, 
which then synchronizes the READY input with the clock. If READY 
is input low at the appropriate 
time, then the 8086 will execute 
"Wait" 
states until 


READY is raised high. 
TEST is an input that is only used by the 8086 WAIT instruction. When the WAIT 
instruction 
is executed, 
the 8086 will pause until TEST is input low. 


INTR is an interrupt request input. This signal is sampled by the 8086 during the 
final clock period of each instruction's 
execution. 
If the interrupt 
enable bit is a 1 and 


INTR is high, then the 8086 will execute 
an interrupt 
acknowledge 
sequence 
and 


transfer control to the appropriate interrupt service routine. Otherwise the next instruc- 
tion will be executed. 
INTR is a level triggered input. 


NMI is a non-maskable 
interrupt 
request input. NMI is an edge triggered input. 


Should NMI go from low to high, the 8086 will complete 
execution 
of the current 


instruction, 
and then transfer control to a non-maskable 
interrupt service routine. The 
address of the non-maskable 
interrupt 
service routine is present at memory 
location 


0000816, Software may not disable this interrupt. 


RESET is a system reset signal; it must be input high to the 8284 Clock Generator 


for at least four CLK clock periods, except on power-up, when RESET must last at least 
50J.l.s.The 8284 synchronizes 
RESET and re-transmits 
it to the 8086. When the RESET 
returns to low, the following events occur: 


I. 
The Flags register is set to 000016, This has the effect of disabling interrupts 
and single stepping mode 


2. 
The OS, SS, ES and PC registers are reset to 000016 
3. 
The CS register is set to FFFFI6 


Execution will continue from memory location FFFFI6• 
The signals that are affected by the MN/MX 
pin include: 


SO-(imJ) 
51-(OT/R) 
S2-(M/iQ) 
RQ/GTO-IHOLO) 
RQ/G'fT-IHLOA) 
QSO-IALE) 
QS1-IINTA) 
LOCK-(WR) 


When MN/MX is grounded, 
the 8086 is said to be in "maximum 
mode." 
When 
MN/MX 
is high the 8086 is said to be in "minimum 
mode." 
SO-IOEN). 
If the MN/MX 
pin is grounded, 
this pin functions as SO. SO is used 
with SI-(DT/R) 
and the S2-(MIIO) to provide status information. 
This status informa- 
tion is discussed 
following the description 
of S2- (M/TO). If the MN/MX 
pin level is 
high, this pin functions as DEN. DEN is used to control the 8286/8287 buffers by enab- 
ling the buffers' data transceivers 
onto the system or local bus (as determined 
by DT/ 
R). 


S1-10T/R). If the MN/MX 
pin is grounded, 
this pin functions as SI. Sf is used 
with SO-(DEN) and the S2-(M/TO) to provide status information. 
This information 
is 
discussed following the description of S2-(M/ID). 
If the MN/MX pin level is high, this 
pin functions as DT/R. DT/R is used to control the 8286/8287 buffers, signalling the 
direction of the data transfer. If DT/R is high, the transceivers 
place data on the system 
bus; if DT/R is low, the transceivers 
take data off the system bus. The 8288 Bus Con- 
troller also generates DEN and DT/R outputs. If the Bus Controller is present, its DEN 
and DT/R outputs are used instead of 8086 DEN and DT/R outputs. We describe these 
different configurations 
later. 


I 
S2-IM/IOl. 
If the MN/MX 
pin is grounded, 
this pin functions as S2. S2 is used 
with SO-(DEN) and SI-(DT/R) 
pin to provide the status information 
described below. 


If the MN/MX 
pin level is high, this pin functions as MilO. During a memory or I/O 
access MilO is high for a memory access and low for an I/O access. 


If the MN/MX 
pin is grounded, 
status 
is provided 
to the 8288 Bus Controller 
via 


SO, 51, and S2 as follows: 


52 
51 
so 


0 
0 
0 
Interrupt Acknowledge 
0 
0 
, 
I/O Read 
0 
1 
0 
I/O Write 
0 
1 
1 
Halt 
1 
0 
0 
Instruction Fetch 
1 
0 
1 
Memory Read 
1 
1 
0 
Memory Write 
1 
1 
1 
Inactive 


This information 
is used by the 8288 Bus Controller 
to generate 
memory 
and I/O 
control 
signals 
for a maximum 
mode 
system. 


QSO-(ALE). If the MN/MX 
pin is grounded, 
this pin functions 
as QSO. QSO is used 
with 
the 
QS 1- (INT A) 
pin to provide 
8086 
instruction 
queue 
status. 
The 
instruction 


queue, 
which 
we 
describe 
later 
in 
detail, 
is 
a 
six- byte 
space 
within 
the 
8086 


microprocessor; 
it is used 
to hold 
object 
code bytes 
awaiting 
execution. 
If the MN/MX 


pin level 
is high, 
QSO-(ALE) 
functions 
as ALE. 
A high 
ALE 
pulse 
is output 
while 
a 
valid memory 
address 
is present 
on the address/data 
bus. In a maximum 
mode 
system, 
ALE 
is provided 
by the 
8288 Bus Controller. 


QS1-(INTAI. 
If the 
MN/MX 
pin is grounded, 
this 
pin functions 
as QS!. 
QSI 
is 


used 
with QSO-(ALE) 
to provide 
8086 instruction 
queue 
status, 
as described 
below. 
If 


the MN/MX 
pin level is high, 
this pin functions 
as INTA. 
INTA 
is output 
low while the 
8086 is executing 
an interrupt 
acknowledge 
sequence. 
In a maximum 
system 
INT A is 


provided 
by the 
8288 Bus Controller. 


If the 
MN/MX 
pin is grounded, 
8086's 
instruction 
queue 
status 
is provided 
by 
QSO and QS I as follows: 


No operation 
The first byte of an instruction is being 
executed 
The queue is being emptied 
A subsequent instruction byte is being 
taken from the queue 


QSO and QS 1 are valid during 
the clock period 
that 
follows 
any queue 
operation. 


RQ/GTO-(HOLDI. 
If the MN/MX 
pin is grounded, 
this pin functions 
as RQ/GTO. 


RQ/GTO 
is a request/grant 
line. Other 
bus masters 
may force the 8086 to enter 
a HOLD 
state 
by inputting 
a low pulse at this pin. The 8086 will acknowledge 
that it is entering 
a 


HOLD 
state 
by outputting 
a low pulse 
via RQ/GTO 
to the 
requesting 
bus master. 
The 
8086 will then 
relinquish 
control 
of the system 
bus and three-state 
outputs. 
When 
the 


new bus master 
subsequently 
relinquishes 
control 
of the system 
bus, it does so by send- 
ing another 
low RQ/GTO 
pulse. 
The 
8086 
then 
reasserts 
bus 
control. 
Request/grant 
sequences 
are described 
in detail 
in Chapter 
8. If the 
MN/MX 
pin level 
is high, 
RQ/ 
GTO-(HOLD) 
functions 
as HOLD. 
HOLD 
is used 
as a HOLD 
request 
line by external 


logic. When 
external 
logic sets the HOLD 
level high, 
the 8086 will enter 
a HOLD 
state 


upon 
completing 
the current 
bus cycle. The 
8086 acknowledges 
that 
it has entered 
the 


HOLD 
state 
by outputting 
HLDA 
high. 


RQ/GT1-IHlDAI. If the MN/MX pin is grounded, 
this pin functions as RQ/GTI. 


RQ/GTI 
is functionally 
identical to RQ/GTO, except that RQ/GTI 
has lower priority 
than RQ/GTO. If an RQ/GTO request/grant 
sequence is not in progress, then the 8086 
can begin an RQ/GTI 
request/grant 
sequence. 
Request grant sequences are covered in 
detail in Chapter 
8. If the MN/MX 
pin level is high, RQ/GTI-(HLDA) 
functions 
as 
HLDA. HLDA is the HOLD Acknowledge signal. HLDA is output high to acknowledge 
a Hold request made via HOLD. When the HLDA signal is raised high, the 8086 CPU 
also floats its three-state 
output signals. Thus it floats the system bus. 
lOCK-IWRI. If the MN/MX pin is grounded, 
this pin functions as the LOCK pin. 
LOCK is output low to prevent the 8086 from losing system bus control while executing 
an instruction. 
While LOCK is low, external hardware should guarantee that other bus 
masters do not gain control of the system bus. When the 8086 executes a LOCK instruc- 
tion the LOCK signal is output low for the duration of the next instruction's 
execution. 
If the MN/MX 
pin level is high, LOCK-(WR) 
functions as the WR pin. WR is pulsed 
low during a memory or I/O write. The trailing edge of the pulse occurs while the data 
being output is stable on the address/data 
bus. 


ClK is the clock signal used to synchronize 
all 8086 logic. This signal is typically 
output by the 8284 Clock Generator. 


VCC is the power supply pin. The 8086 requires +5 V ± 10% be present on this 


Three-state 
lines 
and Signals 


I 


ADO-AD15 
A16/53 
A17/54 
A 18/55 
A19/56 
8HE/57 
RD 
50-(DEN! 
51-(DT/RI 
52-(M/iQl 
LOCK-(WR! 
INTA 


All of these signals are in the high impedance state while the 8086 is in a HOLD 
tate. The SO-(DEN), SI -(DT IR:), and S2- (MilO) signals are floated just prior to 8086 
ssuing a hold acknowledge. 
During an interrupt 
acknowledge, 
the ADO-AD15, 
A16/S3, 
A17/S4, 
A18/S5, 


~ 19/56 lines are floated. 


8086 
OVERVIEW 
AND 
BASIC SYSTEM 
CONCEPTS 


This section 
discusses 
basic 8086 system 
concepts, 
including 
bus cycles, the 
address/data 
bus, the system data bus, the Execution Unit, the Bus Interface Unit, and 
the instruction 
queue. 


The 8086 communicates 
with external logic via its system bus. The 8086 executes 
"bus cycles" to transfer data or fetch instructions. 
A "bus cycle" is shown in Figure 7-2. 
The minimum bus cycle consists offour CPU clock periods called T States. During 
the first T state (TO, the 8086 outputs an address on the 20-bit multiplexed 
address/ 
data/status 
bus. The address on the bus is considered valid when the ALE signal makes 
the transition 
from high to low. In a minimum 
system, this signal is produced by the 
8086. In a maximum system, this sgnal is produced by the 8288. Bus Controller. The S2- 
(MilO) 
signal indicates whether a memory or an I/O access is being performed. 


During the second T State (T2), the 8086 removes the address from the address 
bus. For a read bus cycle the data bus lines are floated in preparation for a read cycle. For 
a write bus cycle data is output on the data bus lines. 
Data bus transceivers 
are enabled 
in either TI or T2, depending 
on the 8086 
system configuration, 
and the direction of the transfer 
(into or out of the 8086). Read, 


write, or interrupt 
acknowledge control signals are always enabled in T2. 


During T2, bus cycle status (S3, S4, S5, S6) is output on the upper four address/ 
status bus lines. The status information 
below is available for the rest of the bus cycle. 


54 
53 


0 
0 
Extra (Relative to the ES segment) 
0 
1 
Stack (Relative to the SS segmentl 


1 
0 
Code/None (Relative to the CS segment 
or a default zero) 


1 
1 
Data (Relative to the DS segment) 


S5 = IF (Interrupt enable flag) 
v 


S6 = 0 (Indicates the 8086 
is on the bus) 


The 8086 continues 
to provide status information 
on the upper four address/ 
status bus lines during T3. Also, the 8086 continues 
to output data during a write bus 
cycle. For a read bus cycle, the 8086 samples input data at the end of T3. If the selected 
device is not capable of transferring 
data at the required transfer rate, the device must 


signal "not ready" by inputting READY low. This causes the 8086 to insert additional 
clock cycles after T3. These additional clock cycles are designated 
as Tw states 
(wait 
states). The "not ready" indication must be presented to the CPU before the start ofT3. 
Bus activity during Tw is the same as T3. When the selected device has had sufficient 
time to complete the transfer, 
it raises READY high. After Tw clock periods end, T4, 
the last clock period of the bus cycle, is executed. 


During T4, memory and I/O control lines are disabled and the selected external 
device disconnects 
itself from the system bus. 


A bus cycle appears 
to devices 
on the system 
bus as an asynchronous 
event, 
con- 
sisting 
of an address 
to select 
the device, 
and a register 
or memory 
location 
within 
the 


device, 
plus a read strobe, 
or a write strobe 
with accompanying 
data. The selected 
device 
accepts 
bus data during 
a write cycle; the selected 
device 
must 
place data on the bus dur- 
ing a read cycle. At the end of the bus cycle, the device 
latches 
written 
data or disables 
its 
bus drivers. 
The only way a device 
can modify 
a bus cycle is by inserting 
wait state 
clock 
periods 
via the READY 
control 
input. 


The 
8086 only executes 
a bus cycle when 
instruction 
fetches 
are being 
performed 
or when 
operands 
must 
be transferred 
between 
the 
8086 and 
memory 
or I/O devices. 


When 
not 
executing 
a bus cycle, 
8086 
bus 
interface 
logic executes 
idle clock 
periods 
(labeled 
TO. 
During 
idle clock periods 
the 8086 continues 
to output 
status 
information 
from 
the 
previous 
bus cycle on the four 
upper 
address 
lines. 
If the previous 
bus cycle 
was a write, 
the CPU continues 
to output 
data onto the 16 data bus lines until the start of 
the 
next 
bus 
cycle. 
If the 
8086 executes 
idle clock 
periods 
following 
a read 
cycle, 
the 
8086 floats 
the 
16 data 
bus lines 
until 
the start 
of the next 
bus cycle. 


When 
accessing 
memory, 
the 8086 performs 
two types 
of operations: 


An instruction 
fetch 


A memory 
access 
to read 
or write 
operands 
required 
by the instruction 


The 
normally 
simple 
sequential 
relationship 
between 
instruction 
fetch 
and 
memory 
access 
bus cycles is modified 
in the 8086 by the presence 
of a 6-byte 
instruction 
object 
code 
queue. 


If the 8086 bus interface 
logic would 
otherwise 
be idle, instead 
it executes 
instruc- 
tion fetch 
bus cycles during 
which 
it fetches 
the next 
sequential 
object 
code bytes 
from 
program 
memory, 
until 
the 
instruction 
queue 
is full. 
If more 
than 
one 
instruction's 
object code is in the queue, 
then an instruction 
fetch bus cycle may be separated 
from its 
operand 
memory 
access 
bus cycles 
by additional 
instruction 
fetch 
bus cycles. 
When 
a jump 
or call instruction 
is executed, 
the 
next 
sequential 
instruction's 
object 
code bytes, 
currently 
in the instruction 
queue, 
are no longer 
needed. 
Queue 
con- 
tents 
are therefore 
discarded 
with no adverse 
effects. 


Since 
most 
memory 
devices 
and 
peripherals 
that 
interface 
to the 
8086 require 
a 
stable 
address 
for an entire 
bus cycle, 
the address 
present 
on the multiplexed 
address/ 


data bus during 
Tl should 
be latched. 
This latched 
address 
is used 
to select 
the desired 
peripheral 
or memory 
location. 
To demultiplex 
the address/data 
bus, the 8086 provides 
an Address 
Latch 
Enable 
signal 
(ALE), 
which can be used to capture 
the address 
either 
in 8282 or in 8283 8-bit 
bistable 
latches. 


These 
latches 
are either 
inverting 
(8283) 
or non-inverting 
(8282) 
and have 
out- 
puts driven 
by tristate 
buffers 
that supply 
32 mA drive capability 
and can switch 
a 300 pF 
capacitive 
load 
in 22 ns 
(inverting) 
or 30 ns 
(non-inverting). 
The 
8282/8283 
latches 
propagate 
the address 
through 
to the outputs 
while ALE is high and latch the address 
on 
the following 
edge of ALE. 
But this delays address 
access and chip select decoding 
by the 
propagation 
delay 
of the latch. 


ALE 
STB 
OE 


8282/83 


A19-A15/ 
DI 
DO 
A19-A16 
56-53 


8086 


STB 
OE 


8282/83 


DI 
DO 
A15-A8 
Address 
Bus 


OE 
STB 


8282/83 


DI 
DO 
A7-AO 
-- 


AD15-ADO 


The latch outputs are enabled by the low active OE input. Demultiplexing 
of the 
multiplexed address/data 
bus (Iatchings of the address from the multiplexed bus) can be 
handled 
locally at appropriate 
points in the system 
or at the CPU, with a separate 
address bus distributing 
the address throughout 
the system. 


For optimum 
system 
performance 
and compatibility 
with multiprocessor 
and 
Multibus 
configurations, 
local demultiplexing, 
illustrated 
in Figure 7-4, is strongly 
recommended 
over 
distributed 
demultiplexing, 
as illustrated 
in Figure 
7-5. The 
remainder of this chapter will assume the bus is demultiplexed 
at the CPU, as illustrated 
in Figure 7-4. 
The 8086 memory address space can be viewed as a sequence of one million bytes 
in which any byte may contain an 8-bit data element and any two consecutive bytes may 
contain a 16-bit data element. 
There is no constraint 
on byte or word address bound- 
aries. The address space is physically connected 
to a 16-bit data bus by dividing the 
address space into two 8-bit banks of up to 512K bytes each. 


FFFFF 


FFFFE 


FFFFD 


FFFFC 


~ 


5 


4 


3 


2 


1 


0 


One bank is connected to the lower half of the 16-bit data bus (07-0) and contains 
even address bytes (AO= 0). The other bank is connected 
to the upper half of the data 
bus (015-8) 
and contains odd address bytes (AO= 1). A specific byte within each bank 
is selected by address lines A 19-A 1. Data bytes are transferred 
to even addresses over 
the lower half of the data bus (07-0). 


AO, when low, selects the memory bank connected 
to the lower half of the data 


bus; Bus High Enable (BHE) is output high to disable the memory bank on the upper 
half of the data bus. This disabling process is necessary to prevent a write operation to 
the lower memory bank from destroying data in the upper bank. Since BHE is a multi- 
plexed signal, with timing identical to the A 19-A 16 address lines, it also must be latched 
by ALE to provide a stable signal for the entire bus cycle. 


During T2 through T4, the SHE output is available on status line S7. (The mean- 
ing of status line S7 has not yet been defined.) 


When accessing memory 
bytes with odd addresses, 
information 
is transferred 
over the upper half of the data bus (015-08). 
SHE is output low to enable the upper 
memory bank. AO is output high to disable the lower memory bank. This may be illus- 
trated as follows: 


The 8086 transfers data via the correct half of the data bus and outputs SHE and 
AO with the required signal levels. 
As an example, consider loading a byte of data into the CL register (lower half of 
the CX register) from a memory location with an odd address. This data will be accessed 
via the upper half of the 16-bit data bus. Although this data is transferred 
into the 8086 
over the upper eight data bus lines, the 8086 automatically 
redirects the data to the 
lower half of its internal 
16-bit data path and hence to the CL register. This capability 
allows byte 1/0 transfers 
via the AL register to access 1/0 devices connected to either 
the upper or lower half of the 16-bit data bus. 
16-bit words that are located at even addresses 
(two consecutive 
bytes with the 
least significant byte at an even byte address) are accessed in a single bus cycle. A 19-A 1 
select the appropriate 
byte within each bank; AO low and SHE low enable both banks 
simultaneously. 


16-bit words located at odd addresses (two consecutive bytes with the least signifi- 
cant byte at an odd byte address) are accessed using two bus cycles. During the first bus 
cycle the lower byte (with the odd byte address) 
is accessed. During the second bus 
cycle, the upper byte (with the even byte address) is accessed. During the first bus cycle, 
A19-Al 
specifies the address. AO is 1 (for an odd address) and BHE is low. Therefore 
the lower memory bank is disabled and the upper memory bank is enabled. During the 
second bus cycle, the address is incremented. 
Therefore 
AO is O.SHE is high, however, 


so the lower memory bank is enabled, and the upper memory bank is disabled. This may 
be illustrated as follows: 


The sequence illustrated above is automatically executed by the 8086 whenever a 
word transfer specifies an odd address. The 8086 automatically connects the upper and 
lower bytes of the 8086's internal16-bit 
registers with the appropriate halves of the data 
bus. Note, however, that accessing a word on an odd address boundary requires an extra 
bus cycle; this degrades system performance. 


During a byte read, the CPU floats the entire 16-bit data bus during clock period 
T2, even though data is expected on the upper or lower half of the data bus but not on 
both halves. This simplifies the chip select decoding requirements 
for read only devices 
(ROM, EPROM). 
(We describe chip select logic later,) During a byte write operation, 
the 8086 will drive the entire 16-bit data bus. The information 
on the half of the data bus 
not transferring 
data is indeterminate. 
These concepts also apply to the I/O address 
space. 


When referring to the system data bus, two implementation 
alternatives 
must be 
considered; 
(a) a multiplexed address/data 
bus, as illustrated in Figure 7-6, or (b) a data 
bus buffered from the multiplexed 
bus by transceivers, 
as illustrated in Figure 7-7. 
When using the multiplexed data bus, a designer must guarantee that memory or 
I/O devices that are connected directly to the multiplexed bus do not corrupt the address 
on the bus during Tl. 
To avoid this situation, 
device output 
drivers should not be 
enabled by the device chip select but should have an output enable controlled 
by the 
system read signal, as illustrated in Figure 7-8. 


8282 
DO 


A15-AB 


8282 
DO 


A7-AO 


8282 
DO 


Multiplexed 
Address/Data 


8282 
A19-A16 
DO 


8282 
DO 


DI 


A7-AO 


8282 
DO 


DI 


T 
B D7-DO 


8286/8287 


A 


STB 


8282 D 


8086 timing guarantees 
that read is not valid until after the address is latched by 
ALE and the multiplexed 
address/data 
bus is floated. 


ALE-.1 


(All Intel Peripherals, 
EPROM Products 
and RAMs, 
if they are intended 
for 
microprocessors, 
provide output 
enable or read inputs that let them connect 
to the 
multiplexed 
bus in the manner 
illustrated in Figure 7-8.) 


Several techniques 
exist for interfacing devices to the multiplexed 
bus when they 
do not have output enables. But each introduces restrictions or limitations if the device 
chip select is externally gated with a command 
as follows: 


RD or WR -----~ 
CS -----e:t.-I">----- 
CS to device 


Consider Figure 7-8, which has chip select gated with read and write. If external 
gating is used, as illustrated 
above, 
this presents 
two problems. 
First the chip select 
access time is reduced to the read access time, which may force the use of an otherwise 
unnecessarily 
fast support 
device, assuming 
that maximum 
system performance 
(no 
wait states) is to be achieved. This is illustrated in Figure 7-9. 
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I,. 
CD 


CD Access time for CS generated from address decode 
o Access time if CS is gated with RD/WR 


In Figure 7-8, for a device which provides separate CS and RD/OE 
inputs, the 
access actually starts from CS internally; 
but the output drivers onto the bus are not 
enabled until RD/OE. Thus the access time is not from RD/OE. This may be illustrated 
as follows: 


CS 


Internal 
)I 
Register 
•. 


RD/OE 


Data Bus 
Connection 


The designer must also verify that chip select to write set-up and hold times for 
the device are not violated, when chip select is gated externally. 
This is illustrated 
in 
Figure 7-10. 
Alternative 
device select techniques 
are available, 
but they also impose special 


estrictions. 
It is therefore 
recommended 
that you connect devices having output ena- 


bles to a multiplexed 
data bus. 
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CD CS is not valid prior to write and becomes active one or two gate delays later 
(3) 
CS remains valid after write one or two gate delays 


Another 
limitation 
on the multiplexed 
data bus illustrated 
in Figure 7-6 is the 
8086's drive capability of 2.0 mA, and its capacitive loading of 100 pf, to guarantee 
the 
specified AC characteristics. 
Assuming capacitive loads of 20 pf per I/O device, 12 pf per 
address latch and 5-12 pf per memory device, a system mix of three peripherals and two 
to four memory devices (per bus line) is very close to the loading limit. 


To satisfy the capacitive loading and drive requirements 
of larger systems, 
the 
data bus must be buffered as illustrated 
in Figure 7-11. The 8286 non-inverting 
and 
8287 inverting octal transceivers 
are offered as part of the 8086 family to satisfy this 
requirement. 
They have tristate output buffers that drive 32 mA on the bus interface 
and 10 mA on the component 
interface; also, they can switch capacitive loads of 300 pf 
at the bus interface and 100 pf on the component 
interface in 22 ns (8287) or 30 ns 
(8286). The 8086 system provides 
Data ENable 
(DEN) and Data Transmit/Receive 
(DT fR) signals to enable the 8286 and 8287 transceivers 
and to control their direction, 


as illustrated in Figure 7-12. 
These signals provide the appropriate 
timing to guarantee 
isolation of the multi- 
plexed bus from the system during T1 and to eliminate 
bus contention 
with the CPU 
during read and write bus cycles. 


Although the memory and peripheral devices are isolated from the CPU, bus con- 
tention may still exist in the system if the memory/peripheral 
devices do not have an 
output enable control in addition to the chip select. This configuration 
is illustrated in 
Figure 7-13. As an example, 
bus contention 
may exist during the transition 
from one 
chip select to another; 
the newly selected device may begin driving the bus before the 
previously selected device disables its drivers. A more severe problem can occur during 
a write bus cycle. A device whose outputs are controlled only by chip select will drive the 
bus from chip select to write active, simultaneously 
with data output 
being written 
through the transceivers 
by the CPU. This condition is illustrated in Figure 7-14. 
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CD 
DEN is enabled after the 8086 
has floated the multiplexed bus. 
@ DEN enables the transceivers early in the cycle. but DT/R guarantees the tranceivers are in transmit 
rather than receive mode and will not drive against the CPU. 
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Figure 7-14. 
Bus Contention 
on the System Bus During Write for 


Devices Without Output Enables 
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The same 
technique 
given for circumventing 
select tImmg problems 
on the 
multiplexed 
bus can be applied here but with the same limitations. 
A second level of buffering can reduce the total load seen by devices on the 
ystem bus. This is illustrated in Figure 7-15. 


Typically, double 
buffering 
is used in multi board systems 
to isolate memory 
arrays. 
Double 
buffering, 
however, 
introduces 
additional 
access delays, 
and more 
important, 
you must pay attention to control of the second transceiver in relationship to 
the system bus and the device being interfaced to the system bus. Several techniques for 
controlling the second transceiver 
are available. 
This first technique, 
shown in Figure 7-16, simply distributes 
DEN and DT/R: 
throughout 
the system. DT/R: is inverted to provide proper direction control for the sec- 
ond level transceivers. 


The second technique, 
shown in Figure 7-17, provides control for devices with 
output enables. 
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Figure 7-18. 
Devices without OE. Common 
or Separate Input/Output 
Limited 


Read Access. Limited CS to WE Hold and Set-Up 


RD is normally used to direct data from the peripheral 
to the system bus. In 
Figure 7-17 the buffer is selected whenever 
a device on the local bus is selected. Bus 


contention 
is possible on the device's local bus during a read, as the read simultaneously 


enables the device output and changes the transceiver 
direction. The contention 
may 


also occur while the read is terminating. 


For devices without output enables, the technique 
illustrated in Figure 7-17 can 
be applied if the chip select to the device is conditioned 
by read or write. This is illus- 
trated in Figure 7-18. 


Controlling 
the chip select with read/write 
prevents 
the device from driving 
against the second transceiver 
prior to the command 
being received. The limitations of 
this technique 
are: 


1. 
Access limited to read/write 
time, as discussed previously, and 
2. 
Chip select limited to write set-up and hold times. 


An alternate 
technique 
applicable to devices with and without output enables is 
shown in Figure 7-19. 


RD again controls the direction of the second transceiver, 
but it is not enabled 
until a command and chip select are active. The possibility for bus contention 
still exists 
but is reduced 
to variations 
in output 
enable 
versus 
direction 
change time for the 
transceiver. 
Full access time from chip select is now available. However, data will not be 
valid prior to write and will only be held valid after write by the amount of delay required 
to disable a transceiver. 


One last technique 
is given for devices with separate 
inputs and outputs. 
See 
Figure 7-20. 


Figure 7-19. 
Devices without DE. Common 
or Separate Input/Output 


Full Read Access. Limited Write Data Set-Up and Hold 


Separate bus receivers and drivers are provided, rather than the single transceiver 
illustrated thus far. The receiver is always enabled, while the bus driver is controlled by 
RD and chip select. The only possibility for bus contention 
in this system occurs when 
multiple devices on each line of the read bus are enabled and disabled during chip selec- 
tion changes. 


Throughout 
this section on interfacing the 8086, the multiplexed bus will be con- 
sidered the "local" 
CPU bus, and the demultiplex address and buffered data bus will be 
the "system 
bus." 


The most important 
concept to understand 
when looking at 8086 instruction 
execution timing is the fact that 8086 bus control logic has been separated from the 8086 
instruction 
execution logic. That is to say, the 8086 has an Execution Unit (EU) and a 
Bus Interface Unit (BIU). 
The Execution Unit (EU) contains data and address registers, the Arithmetic and 
Logic Unit, plus the Control Unit. The Bus Interface Unit (BIU) contains bus interface 
logic, segment registers, memory addressing logic, and a six-byte instruction object code 
queue. This may be illustrated as follows: 
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The 
Execution 
Unit 
(EU) 
and 
the 
Bus 
Interface 
Unit 
(BIU) 
operate 
asynchronously. 
Whenever the Execution Unit is ready to execute a new instruction, 
it 
fetches the instruction 
object code from the front of the Bus Interface Unit instruction 


queue, 
then it executes 
the instruction 
in some number 
of clock periods that have 
nothing to do with bus cycles. If the instruction object code queue is empty, then the Bus 
Interface Unit (BIU) executes an instruction fetch machine cycle - 
and the CPU waits 
for the instruction 
object code to be fetched. But the queue will rarely be empty, for 
reasons that will soon become apparent. Therefore, 
the EU will usually not have to wait 
while an instruction 
fetch is executed. 
If memory 
or an I/O device must be accessed in the course of executing 
an 
instruction, 
then the EU informs the BIU of its needs. The BIU executes an appropriate 
external access machine cycle in response to the EU demand. 


The Bus Interface Unit (BIU), for its part, is independent 
of the Execution Unit 


(EU) and attempts to keep the six-byte queue filled with instruction object codes. If two 
or more of these six bytes are empty, then the Bus Interface 
Unit (BIU) executes 
instruction fetch machine cycles - 
provided the EU does not have an active request for 
bus access pending. If the EU issues a request for bus access while the BIU is in the mid- 
dle of an instruction 
fetch machine cycle, then the BIU will complete the instruction 
fetch machine cycle before honoring the EU bus access request. 


Consider 
what happens 
when an instruction 
is executed. 
Beginning with the 
simplest case, the instruction 
object code queue within the Bus Interface Unit will be 
empty. Therefore, 
when the EU requests an instruction, 
the BIU will execute a bus cycle 
to fetch the first byte of the instruction. 
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Let us assume that this particular instruction 
requires two bytes of object code. 
Keeping things simple, we will illustrate another 
bus cycle executed 
immediately 
to 
fetch the next instruction 
byte: 
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Let us suppose that this instruction reads a word of data from memory, then per- 
forms an arithmetic 
operation using this data. The instruction 
is going to require some 
number of clock periods to compute the effective address for the data memory location 
to be accessed (we will assume seven clock periods are needed). 
Some additional num- 
ber of clock periods will also be needed to perform the arithmetic 
operation 
(we will 
assume 
nine clock periods). 
In a normal 
microprocessor, 
this instruction 
might be 
executed as the following sequence of machine cycles: 
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But the 8086, having asynchronous 
CPU and Bus Control 
Unit logic, will use 
clock periods to execute the instruction 
i1lustrated above as follows: 
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second byte of 
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one bus cycle 


The EU computes 
a data memory 


address in 7 clock petiods. 
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end of the 7th clock per'lad the 


CPU requests bus access 
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data to be 
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access. the BIU fetches the next two 
from memory 
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Now, the illustration 
above is not accurate 
because, 
you will recall, the 8086 
fetches data in I6-bit increments, 
provided the data address lies on an even-byte bound- 
ary. Also, the BlU fetches instruction 
bytes and loads them into the queue only when 
there are at least two free bytes in the queue. Let us assume that all data does lie on 
even-byte 
boundaries. 
This is how our timing will now look: 
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object code byte. 
There 
are 
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exe- 
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The 
EU cO:Tlputes 
a data 
memory 
address 
in 7 clock 
periods. 
At 
the 
end of 
the 7th clock 
period 
the 
EU requests 
bus access. 


The 
E U wails 
for 
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requested data 10be 
fetched 
by 
the BCU. 


BIU 
fetches 
two 


bytes of object 
code 
in one bus 
cycle. 
The CPU 
takes both 
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queue 
IS Imme· 
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BIU 
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bus cycles 
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two 
more 
bytes 
of object 
code 
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stores 
them 
In the 


Queue which 
is 


now 
full. 


EU 
The 
EU ends instructIon 
execution 
and 
fetches 
one byte 
of object 
code 
from 
Queue to execute 
next 
instruction. 


There are some important 
points to note regarding 8086 bus cycle timing. 
Bus cycles are a Bus Interface Unit (BIU) phenomenon. 
So far as the EU logic is concerned, 
bus cycles do not exist. The EU experiences 
periods of activity while executing instructions and periods of inactivity while waiting for 
instruction object codes or data that the BIU must process via bus cycles. Periods of EU 
activity are timed by a sequence 
of clock periods. The EU makes no attempt to group 


clock periods into machine cycles, nor do EU clock periods have to occur in any special 
numeric combinations. 
So far as the BIU is concerned, 
clock periods are grouped into bus cycles only 
when data must be transferred 
to or from the 8086. First priority is given to a bus access 
request coming from the EU. If the EU is not requesting 
bus access, then the BIU 
executes instruction 
fetch bus cycles until the queue is full. These are the prerequisites 
for the BIU to execute an instruction 
fetch bus cycle: 


1. 
The clock period that initiates the bus cycle would otherwise be an idle clock 
period. 


2. 
The EU does not have an active bus access request pending. 
3. 
There are at least two bytes empty in the queue. 


If the queue is full, then the BIU ceases to execute bus cycles; as illustrated above, 
a sequence of idle clock periods occurs. 
Note that the CPU may have to wait for bus access. In the illustrations above, the 
CPU requires seven clock periods in order to compute a data memory address. At the 
end of the seventh clock period, the EU issues a bus access request to the BIU. But at 
this time the BIU is part way through executing an instruction 
fetch bus cycle. The BIU 
completes the instruction 
fetch bus cycle, then honors the EU bus access request. 


In the final illustration 
above, no bus cycle accompanies 
the beginning of a new 
instruction's 
execution. 
We are assuming that the next instruction 
executed 
has one 
byte of object code. This object code byte is fetched from the front of the queue - which 
then has just one empty byte. No bus cycle is executed to fetch the instruction 
object 
code, since it is taken out of the queue. Subsequently, 
the BIU does not execute an 
instruction fetch bus cycle since there is only one empty byte. There must be at least two 
empty bytes in the queue before the BIU will execute an instruction 
fetch bus cycle. 


Based on the foregoing discussion of 8086 instruction 
fetch queuing, we can see 
that the 8086 has essentially eliminated instruction fetch time. The only time the EU will 
have to wait while the BIU fetches instruction 
object codes is when a Branch-on-Condi- 
tion instruction 
causes execution to branch out of the queue sequence or when (for any 
reason) the memory accesses accompanying an instruction's 
execution are so dense that 
the BIU has insufficient idle clock periods within which to insert instruction 
fetch bus 
cycles. 


Basic 8086 
Design Single CPU 


The 8086 is easily configured to operate in a variety of markedly different applica- 
tions. The MN/MX input, which we described in Chapter 7, is a strapping option that 
allows the 8086 to function with two different sets of outputs, 
identified as "minimum 
mode" 
and "maximum 
mode". 
We wiII now examine these two modes in more detail. 


The minimum 
mode 8086 has the MN/MX 
pin connected 
to Vcc. Minimum 
mode should be used in one or two board single CPU systems. Figure 8-1 illustrates a 
minimum 
mode 8086. 


In minimum 
mode the 8086 addresses 
a full megabyte memory space and 64K 
byte I/O space. The data bus is 16 bits wide. The 8086 directly provides bus controls 
(DT/R, 
DEN, ALE, MilO, RD, WR, INTA). A simple CPU preemption 
mechanism, 
compatible with existing DMA controllers, 
is enabled via the HOLD and HLDA signals. 
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The maximum mode, illustrated in Figure 8-2, has the MN/MX 
pin connected to 
ground. Maximum 
mode is used in multiprocessor 
and co-processor 
configurations. 


In maximum 
mode, an 8288 Bus Controller 
receives control signals, as inputs, 


from the 8086. These inputs are decoded by the 8288 to generate Control output signals. 
Other 8086 Control outputs are also modified to provide external logic with more infor- 
ration. 
Specifically, 8086 output signals are redefined as follows: 


1. 
Queue status is output at QSO and QS1:This allows external devices, e.g., an 
ICE/86 or special instruction 
set extension 
co-processors, 
to track the CPU 
instruction 
execution. 


2. 
System control 
and configuration 
options 
are expanded 
via the bus cycle 
status outputs SO, S1, and S2. These outputs are used by the 8288 bus con- 
troller, 8289 bus arbiter, and similar external devices. 


3. 
Access control to shared resources in multiprocessor 
systems is supported 
by 
a bus lock mechanism. 


4. 
Two prioritized 
levels of processor 
preemption 
(RQ/GTO, 
RQ/GTl) 
allow 
multiple processors 
to reside on the 8086's local bus sharing a system bus 
interface. 


We will now examine how these expanded capabilities may be used. 
The queue status indicates what information 
is being removed from the internal 
queue and when the queue is being reset due to a transfer of control. Table 8-1 summar- 
izes Queue Status interpretation. 
Using logic akin to that illustrated in Figure 8-3, you can track 8086 queue status. 


When SO, SI, and S2 are 1, 0, and 0, respectively, 
an instruction 
fetch is being per- 
formed. QSO and QSl indicate whether the instruction 
is being fetched from the 8086 
queue or from external memory. For an external memory access, AOand BHE indicate 
a word or byte access. This logic can be used in a number of ways; consider the following 
examples. 


The ICE/86 can track execution of an instruction stored in a specific memory loca- 
tion. Figure 8-3 gives an example of a circuit used by the ICE/86 to track the queue. 


Table 8-1. Queue Status Outputs 


aS1 
aso 
Interpretation 


0 
0 
No Operation 


0 
1 
First Byte of Op-code from Queue 


1 
0 
Empty the Queue 
1 
1 
Subsequent Byte from Queue 


The queue status is valid during the elK cycle after which 
the queue operation is performed. 
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The first up-down counter tracks the depth of the queue, while a second up-down 
counter 
captures 
the queue depth 
on a match. The second counter 
decrements 
on 
further fetches from the queue until the queue is flushed, 
or the count goes to zero, 
indicating execution of the match address. The first counter decrements 
on a fetch from 
the queue (QSO= 1) and increments 
when an object code byte is stored into the queue. 


Note that a normal instruction 
fetch from external memory will transfer two bytes into 
the queue so two clock increments 
are given to the counter 
(T20l and T30l). 
When a 
single byte is loaded over the upper half of the bus (AO is high and BHE is low), the 
counter is incremented 
once. Since the EU is not synchronized 
to the BlU, a fetch from 
the queue can occur simultaneously 
with a transfer into the queue. The exclusive-OR 
gate driving the ENP input of the first counter allows these simultaneous 
operations to 
cancel each other and not modify the queue depth. 


The 8086 queue might be tracked by a co-processor 
to detect execution 
of an 
ESCAPE instruction. 
The ESCAPE instruction 
will direct the co-processor 
to perform 
some specific task. 
Table 8-2 defines interpretations 
of status lines SO, Sl, and S2. These status lines, 
you will recall, tell the 8288 when to initiate a bus cycle, what type of command to issue, 
and when to terminate the bus cycle. The 8288 samples the status lines at the beginning 
of each CPU clock period. At the start of a bus cycle, the CPU drives the status lines 
from the passive state (the SO, Sl, S2 lines are all high) to one of the seven possible 
active states. 


For each new bus cycle the 8086 will alter the state of SO,Sl, and S2 on the rising 
edge of the T4 clock during the previous bus cycle, or during a TI idle cycle, if there is no 
current bus activity. The 8288 detects a status change by sampling the status lines on the 
high-to-low transition 
of each clock period. The 8288 starts a bus cycle by generating 
a 
high ALE pulse, accompanied by appropriate buffer direction controls; this occurs dur- 
ing the clock period immediately 
following detection 
of the status change. The bus 
transceivers 
and the selected operation are enabled during the next clock period. When 
the status returns to the passive state, the 8288 will terminate 
the operation. Timing is 
illustrated in Figure 8-4. 
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The 8086 maintains SO,S1, and S2levels during Wait states. A Wait state, you will 
recall, is induced by external logic inputting a low RDY signal to the 8284 Clock Genera- 
tor. The Clock Generator 
outputs a high READY signal, which is synchronized 
with 
CLK and transmitted 
to the 8086. Since the 8086 maintains SO, Sl and S2 levels during 
Wait states, the 8288 will maintain active bus control for a Wait state extending over any 
number of clock periods. The status lines may also be used by any other processors on 
the 8086's local bus to monitor 
bus activity and control the 8288 if those other pro- 
cessors gain control of the local bus. 
The 8288 provides bus control signals DEN, 
DT/R, 
ALE, and control signals 
INT A, MRDC, 10RC, MWTC, AMWC, 10WC, AIOWC. The control signals separate 
read and write operations 
for memory 
and I/O, 
to be compatible 
with the 
Intel 
MULTIBUS. 


The advanced write control signals are enabled one clock period earlier than nor- 
mal write control signals to accommodate 
the wider write pulse widths often required by 
peripherals 
and static RAMs. The normal write control signal allows data to be set in 
advance of the write pulse, to accommodate 
dynamic RAM memories and I/O devices 
that strobe data on the leading edge of the write control pulse. The advanced write con- 
trol signals do not guarantee that data is valid prior to the leading edge of the control sig- 
nal. 
. 


The DEN signal in the maximum mode is inverted, as compared to the minimum 
mode. This makes it easier to logically gate DEN with other signals, particularly inter- 
rupt controls. Figure 8-5 compares the timing of the minimum and maximum mode bus 
transfer control signals. 
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Maximum 
mode is designed 
for multiprocessing 
configurations, 
and for large 
single CPU designs (either Multibus systems or systems that contain two or more PC 
boards). The 8288 is a bipolar device; therefore, 
its 32 ma output drive for the Control 
signals and tolerances on timing parameters 
and worse case delays provide better large 
system performance 
than the minimum 
mode 8086. 
In addition to assuming the functions removed from the 8086, the 8288 provides 
additional strapping options and controls to support multiprocessor 
configurations 
and 
peripheral devices on an 8086 local bus. These capabilities allow resources, 
including 
memory or 1/0, to be assigned as either shared or local. Shared resources are available 
on the Multibus system bus. Local resources are accessible only by this 8086 on its local 
bus. This technique 
reduces contention 
for access to the Multibus 
system bus, and 
improves 
multi-CPU 
system performance. 
Specific configurations 
are described 
in a 
later chapter. 


The 8086 maximum mode LOCK output helps control access to shared resources. 
The LOCK output is activated when the 8086 executes the LOCK prefix instruction. 
The LOCK output 
goes low during the first clock period following execution 
of the 
LOCK prefix; it remains low through the last instruction 
execution clock period for the 
instruction 
following the LOCK prefix, and during the first clock period of the next 
instruction's 
execution. 
The LOCK signal must 
be part of every microprocessor's 
system bus arbitration 
logic. 


During normal multiprocessor 
system operations, 
priority for shared system bus 
access is determined 
by arbitration 
circuitry on a cycle by cycle basis. When an 8086 
needs to transfer data via the system bus, it requests bus access. When the 8086 gains 
priority, as determined 
by any system bus arbitration scheme selected, it takes control of 


the system bus, executes its bus cycle, then either maintains system bus control, volun- 
tarily releases the system bus or is forced off the system bus by the loss of priority. The 
lock mechanism 
prevents an 8086 from losing system bus control, either voluntarily or 


involuntarily. 
This guarantees 
an 8086 the ability to execute multiple bus cycle instruc- 


tions without intervention 
and possible corruption 
of the data by another 
CPU. The 
activity of the LOCK output is shown in Figure 8-6. 


Note that the LOCK output will go inactive between separate locked instructions. 


Also, the LOCK prefix adds two clock periods to execution 
time. 


Since queue status reflects the queue operation in the previous clock period, the 


LOCK output actually goes active coincident with the start of the next (locked) instruc- 
tion and remains active for one clock period following the locked instruction's 
execu- 
tion. 


If the instruction 
following the LOCK prefix does not have its object code in the 


queue, the LOCK output goes low as shown while the instruction 
object code is being 
fetched from external memory. 


The Bus Interface 
Unit (BIU) will still perform instruction 
fetch cycles during 


execution 
of a locked instruction. 
The LOCK merely guarantees 
that one 8086 will 


maintain system bus control for the duration of an instruction's 
execution; 
it in no way 


restricts the type of bus activity that this CPU can perform during this locked time. 


Lock Prefix 


Byte from Queue 
NOP Byte 
Next Lock Prefix 


from the Queue 
from the Queue 
(Locked NOP) 


The lock mechanism 
is commonly 
used during a TEST and SET handshaking 
sequence. 
During this sequence 
an 8086 reads from a shared 
memory 
location and 
returns data to the location. No other CPU can be allowed to reference this memory 
location between the TEST, which is a read operation, 
and the SET, which is a write 
operation. The 8086 accomplishes this with a locked Exchange instruction, 
as follows: 


reg.memory 
;reg is any of the 8086 
;registers. memory is the 
;address of the semaphore 


Another interesting use of the LOCK in multiprocessor 
systems is a locked block 
move, which allows high speed message transfer from one CPU's message buffer to 
another. 
During the locked instruction, 
a request for processor preemption 
is recorded, 


occurring 
via a RQ/GT 
line but not acknowledged 
until completion 
of the locked 
instruction. 
The LOCK prefix does not have any direct effect on interrupts. 
In general, prefix 


bytes are considered 
extensions 
of the instructions 
they precede. Therefore, 
interrupts 


that occur during execution of a prefix are not acknowledged 
(assuming interrupts 
are 
enabled) 
until completion 
of the instruction 
following the prefixes. Note that multiple 
prefix bytes may precede an instruction; 
the repetition prefix (REP) is interruptible 
after 
each execution 
of the following instruction. 
This is true even if the REP is combined 
with the LOCK prefix, so that interrupts 
are not locked out during a block move, or 
other repeated string operation. Further information 
on the operation and string opera- 
tion with multiple prefixes is presented 
later in this chapter in the section dealing with 
the 8086 interrupt 
structure. 


Additional 
levels of prioritized 
processor 
preemption 
are discussed 
in greater 
detail later in this chapter. 


The 8086 requires a clock signal with fast rise and fall times (10 ns max) between 
ow and high voltages of -0.5 
to +0.6 
low and +3.9 
to VCC+l.O 
high. Maximum 
lock frequency of the 8086 is 5 MHz. Since the design of the 8086 incorporates dynamic 
ells, a minimum 
frequency 
of 2 MHz is required. 
Due to the minimum 
frequency 
equirement, 
single stepping or cycling of the CPU may not be accomplished 
by disab- 
ing the clock. Timing and voltage requirements 
of the CPU clock are illustrated 
in 
igure 8-7. 


I 
In general, for frequency below the maximum, 
the CPU clock need not satisfy the 


[requency dependent 
pulse width limitations stated in the 8086 data sheets. The values 
specified only reflect the minimum 
values that must be satisfied, and they are stated in 


Iterms of the maximum 
clock frequency. 
As the clock frequency approaches 
the max- 
imum frequency of the CPU, the clock must conform to a 33% duty cycle to satisfy the 
ICPU minimum 
clock low and high time specifications. 
An optimum 33% duty cycle clock with the required voltage levels and transition 
imes can be obtained with the 8284 clock generator, 
as illustrated in Figure 8-8. 


Either an external 
frequency 
source or a series resonant 
crystal may drive the 
8284. The selected source must oscillate at three times (3X) the desired CPU frequency. 


o select the crystal inputs of the 8284 as the frequency source for clock generation, 
the 
FIE input to the 8284 must be strapped to ground. The strapping option allows either 
he crystal or the external frequency input to be selected as the clock generator source. 
~Ithough 
the 8284 provides an input for a tank circuit to accommodate 
overtone mode 
crystals, fundamental 
mode crystals are recommended 
for more accurate 
(and stable) 


[requency generation. 
When selecting a crystal for use with the 8284, series resistance 
should be as low as possible. Since other circuit components 
will tend to shift the operat- 


~ngfrequency from resonance, 
the operating impedance will typically be higher than the 
~pecified series resistance. If the attenuation 
of the oscillator's 
feedback circuit reduces 


8284 
ClK 


F/C 


Figure 8-8. Using the 8284 to Provide eLK 


the loop gain to less than one, the oscillator will fail. Since the oscillator delays in the 
8284 appear as inductive elements to the crystal, causing it to run at a frequency below 
that of the pure series resonance, 
a capacitor should be placed in series with the crystal 
and the X2 input of the 8284. This capacitor cancels the inductive element. The value of 
the capacitor (C) must not cause the impedance 
of the feedback circuit to reduce the 
loop gain below one. The impedance of the capacitor is a function of the operating fre- 
quency and can be determined 
from the following equation: 


XCl = 
1 
27T 
• F • Cl 


It is recommended 
that the crystal series resistance, plus XCL, be kept less than lK 
ohms. This capacitor also serves to debias the crystal and prevent a DC voltage bias from 
straining 
and perhaps 
damaging 
the crystalline 
structure. 
As the crystal frequency 
increases, 
the amount 
of capacitance 
should be decreased. 
For example, 
a 12 MHz 
crystal may require C = 24 pf while 22 MHz may require C = 8 pf. If very close correla- 
tion with a pure series resonance is not necessary, a nominal C value of 12 to 15 pf may 
be used with a 15 MHz crystal (i.e., 5 MHz 8086 operation). 
Board layout and compo- 
nent variances 
will affect the actual amount 
of inductance, 
and therefore 
the series 
capacitance required to cancel it out. This is especially true for wire-wrapped layouts. 


Two of the many vendors who supply crystals for Intel microprocessors 
are listed 
in Table 8-3, along with crystal part numbers 
for various frequencies 
that may be of 
interest. 


For additional information 
on specifying crystals for Intel components 
see Intel 
application note AP-35. 
If a high accuracy frequency 
source, 
externally 
variable frequency 
source or a 
common source for driving 8284s is desired, the External Frequency Input (EFI) of the 
8284 can be selected by strapping the Fie input to +5 volts through 
lK ohms as illus- 
trated in Figure 8-9. 
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Crystek(11 
CT S Knight. (2) 
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Corp. 
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3.6 MHz 
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.. 
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5.185 MHz 
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CY8A 
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6.0 MHz 
P 
.. 
MP060 


6.144 
MHz 
P 
.. 
MP061 


6.25 MHz 
P 
.. 
MP062 


10.0 MHz 
P 
.. 
MP10A 


15.0 MHz 
S 
CY15A 
MP150 


18.432 
S 
CY198· 
MP184· 
24.0 MHz 
S 
.. 
MP240 


25.0 MHz 
S 
.. 
MP250 


27.0 MHz 
S (overtone) 
CY27A 
MP270 


·Intel also supplies a crystal numbered 8801 
for this applicaton. 


··Contact vendor with the appropriate specifications. 


Notes: 
1. Address: 
1000 Crystal Drive, Fort Meyers. Florida 33901 


2. Address: 
400 Reimann Ave .. Sandwich, Illinois 


The external frequency source should be TTL compatible, have a 50% duty cycle 
and oscillate at three times the desired CPU operation 
frequency. The maximum 
EFI 


frequency the 8284 can accept is slighty above 24 MHz, with minimum 
clock low and 


high times of 13 ns. Although no minimum EFI frequency is specified, it should not vio- 
late the CPU minimum 
clock rate. If a common frequency is used to drive 8284s dis- 


tributed 
throught 
the system, 
each 8284 should be driven by its own line from the 


source. To minimize noise in the system, each line should be a twisted pair, driven by a 
buffer such as the 74LS04, with the ground of the twisted pair connecting the grounds of 
the source and receiver. To minimize clock skew, the lines to all 8284s should be of 
equal length. A simple technique for generating a master frequency source for additional 
8284s is shown in Figure 8-10. 
In Figure 8-10 one 8284, with a crystal, is used to generate the desired frequency. 


The oscillator output of the 8284 (OSC) equals the crystal frequency, 
and is used to 


drive the external frequency to all other 8284s in the system. 


The oscillator output (OSC) is inverted, becoming the complement 
of the oscilla- 


tor signal used to drive the CPU clock generator circuit. Therefore the oscillator output 
of one 8284 should not drive the EFI input of a second 8284 if both are driving clock 
inputs of separate 
CPU s that are to be synchronized. 
The variation on EFI to CLK 


delay over a range of 8284s may approach 35 to 45 ns. If, however, all 8284s are of the 
same package type, have the same relative supply voltage and operate in the same tem- 
perature environment, 
the variation will be reduced to between 
15 and 25 ns. 


There are three frequency outputs in the 8284: the oscillator (OSC) mentioned 
above, the system clock (CLK) that drives the CPU, and a peripheral clock (PCLK) that 
runs at one half the CPU clock frequency. OSC is only driven by the crystal, and is not 
affected by the Fie strapping option. If a crystal is not connected to the 8284 when the 
external 
frequency 
input is used, 
OSC is indeterminate. 
CLK is derived 
from the 
selected frequency source by an internal divide by three counter. The counter generates 
the 33% duty cycle clock, which is optimum for the CPU at maximum frequency. PCLK 
has a 50% duty cycle and runs at one half the frequency of CLK. 
Since the state of the 8284 divide by three counter 
is indeterminate 
at system 
initialization 
(power on), an external synchronization 
signal to the counter (CSYNC) is 
provided to allow synchronization 
of the CPU clock to an external event. When CSYNC 
is brought high, the CLK and PCLK outputs are forced high. When CSYNC returns 
low, the next positive clock from the frequency source starts clock generation. 
CSYNC 


must be active for a minimum 
of two periods of the frequency 
source. If CSYNC is 
asynchronous 
with the frequency source, the circuit in Figure 8-11 should be used for 
synchronization. 


The two latches minimize the probability of a meta-stable state in the latch driving 
CSYNC. The latches are clocked with the inverse of the frequency source to guarantee 
the 8284 set up and hold time of CSYNC to the frequency source, as shown in Figure 8- 
12. 


If a single 8284 is to be synchronized 
to an external 
event, and an external fre- 


quency source is not used, the oscillator output of the 8284 may be used to synchronize 
CSYNC, as illustrated in Figure 8-13. 
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Since the oscillator output is inverted with respect to the internal oscillator signal, 
the inverter 
in the previous example is not required. 
If multiple 8284s are to be syn- 
chronized, 
an external frequency source must drive all 8284s, and a single CSYNC syn- 
chronization 
circuit must drive the CSYNC input of all 8284s as illustrated 
in Figure 
8-14. 
Since the 8086 minimum 
clock low time may not be met when CSYNC is acti- 
ated, it should be enabled only during a reset, or while the CPU clock is high. CSYNC 
ust also be disabled for a minimum 
of four clock periods before the end of reset to 
guarantee 
proper CPU reset. 
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Due to the fast transitions and high drive (5 mA) of the 8284 CLK output, it may 


be necessary to put a 100 ohm resistor in series with the clock line to eliminate ringing. If 
multiple sources of CLK are needed with minimum 
skew, CLK can be buffered by a 


high drive device 
(74S24l) 
with outputs 
tied to five volts through 
100 ohms 
to 


guarantee 
YCH=3.9 
min (8086 minimum 
input high voltage). 
A single 8284 should not be used to generate the CLK for multiple CPUs since the 
8284 synchronizes 
READY to the CPU and can only accommodate 
READY for a single 
CPU. 


74S241 


ClK 
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The 8086 requires an active high reset with minimum 
pulse width of four CPU 
clock periods, except after power on, which requires a 50 f.tS reset pulse. Since the CPU 
internal1y synchronizes 
reset with a clock, the reset is internal1y active for up to one 


clock period after the external 
reset goes low. A Non-Maskable 
Interrupt 
(NMI), 
a 
minimum 
mode hold request, 
or a maximum 
mode RQ pulse that occurs during the 
internal reset, will not be acknowledged. 
A minimum mode hold request or a maximum 
mode RQ pulse active immediately 
after the internal reset will be honored 
before the 
first instruction 
fetch. 
After the 8086 recognizes the reset, the CPU will condition the bus as shown in 
Table 8-4. 


Signals 
Condition 


ADO-AD15 
Tristate 


A 16-A 19/53-56 
Indeterminant 
SHE/57 
Indeterminant 
52/(M/fQl 


} 0",," " ..,..'"'" ",,,,,. 


51/(DT/R) 
50/(DEN) 
lOCK/WR 
AD 
INTA 
ALE 
0 
HlDA 
0 
RQ/GTO 
1 
RQ/GT1 
1 
Q50 
0 
Q51 
0 


Reset 
Input 


The multiplexed 
bus signal connections 
are floated by the CPU when it detects a 


reset. Other signals which can be floated are driven to the inactive state for one low state 
of CLK prior to entering tristate. This is illustrated in Figure 8-16. 


In minimum mode, ALE and HLDA are driven inactive but are not floated. In the 


maximum 
mode, RQ/GT lines are held inactive and the queue status outputs 
(QO and 
QI) indicate no activity. The queue status will not indicate a queue reset, so any user 
defined external circuits monitoring the queue should also be reset by the system reset. 
22K ohm pull-up resistors should be connected to the CPU command and bus control 
lines; this will guarantee 
the inactive state of these lines in systems where leakage cur- 


rents (or bus capacitance) 
may cause the voltage levels to settle below the minimum 
high voltage of devices in the system. In maximum 
mode systems, the 8288 contains 


internal pull-ups on the SO-S2 inputs; this maintains 
the inactive state for these lines 
when the CPU floats its bus. The high state of the status lines during a reset causes the 
8288 to treat the reset sequence as a passive state. The condition of the 8288 outputs for 
the passive state are shown in Table 8-5. 


Table 8-5. 8288 Outputs 


during Passive State 


ALE 
0 
DEN 
0 
DT/Fi 
1 
MCE/PDEN 
Oil 
Commands 
1 


If a reset occurs during a bus cycle, the status lines will return to the passive state, 


the bus cycle will end, and the command lines will become inactive. Note that the 8288 
does not float the command outputs based on the passive state of the status lines. If the 
designer 
needs to disconnect 
the CPU from the bus during 
reset in a single-CPU 
system, the reset signal should also be connected to the 8288's AEN input, and the out- 
put enable of the address latches, as illustrated in Figure 8-17. 
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This technique 
forces the command 
and address bus interface to float while the 
inactive state of DEN from the 8288 floats the transceivers 
on the data bus. 


In multiple 
processor 
systems 
using arbitration 
to establish 
microprocessor- 


shared bus connections, 
the system reset should be connected to the INIT input of the 
8289 Bus Arbiter in addition to the 8284 reset input, as shown in Figure 8-18. 
The active low INIT input forces all 8289 outputs to their inactive state. The inac- 
tive state of the 8289 AEN output will force the 8288 to float the command 
outputs; 
also, the address latches will float the address bus interface. For multi-microprocessor 
systems where more than one microprocessor 
can function 
as the master, 
the reset 
should be common to all CPUs, 8289s, and 8284s; this reset must satisfy the maximum 
of either the CPU reset requirements, 
or three 8289 bus clock period times (TBLBL) 
plus three 8086 clock period times. This will satisfy 8289 reset requirements. 
The 8288 command 
outputs are floated during reset, the command 
lines should 
be pulled up to Vcc through 2.2K ohm resistors. 
The reset signal to the 8086 can be derived from the 8284. The 8284 has a Schmitt 
trigger input for generating 
reset from an active low external 
reset. The hysterisis 
specified in the 8284 data sheet implies that at least 0.25 volts will separate the 0 and 1 
switching point of the 8284 reset input. Inputs without hysterisis will switch from low to 
high and high to low at approximately 
the same voltage threshold. 
The inputs are 
guaranteed 
to switch at specified low and high voltages (VIL and VIH), but the actual 
switching 
point is anywhere 
in between. 
Since VILmin 
is specified at 0.8 volts, the 
hysterisis guarantees 
that the reset will be active until the input reaches at least 1.05 
volts. A reset will not be recognized until the input drops at least 0.25 volts below the 
reset input's VIH of 2.6 volts. 


Figure 8-18. 
Reset Disable for Maximum 
Mode 


8086 Bus Interface in Multi-CPU System 


To guarantee 
reset from powerup, the reset input must remain below 1.05 volts 
for 50 microseconds after Vcc has reached the minimum supply voltage of 4.5 volts. The 
hysterisis allows the reset input to be driven by a simple RC circuit as shown in Figure 8- 
19. 


The calculated RC value does not include time for the power supply to reach 4.5 
volts, or the charge accumulated 
during this interval. Without the hysterisis, the reset 


output might oscillate as the input voltage passes through the switching voltage of the 
input. The calculated 
RC value provides 
the minimum 
required 
reset period of 50 


microseconds 
for 8284s that switch at the 1.05 volt level, and a reset period of approx- 
imately 162 microseconds 
for 8284s that switch at the 2.6 volt level. If tighter tolerance 


between the minimum and maximum reset times is necessary, the reset circuit shown in 
Figure 8-20 might be used, rather than the simple RC circuit. 


The circuit illustrated in Figure 8-20 provides a constant current source and linear 


charge rate on the capacitor, rather than the inverse exponential 
charge rate of the RC 
circuit. The maximum 
reset period for this implementation 
is 124 microseconds. 


The 8284 synchronizes 
the reset input with the CPU clock to generate the reset 
signal to the CPU, as illustrated in Figure 8-21. 


The output is also available as a general reset to the entire system. The reset has 


no effect on any clock circuits in the 8284. 
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The 8086 uses the READY signal to accommodate 
memory and I/O devices that 
cannot transfer information at the maximum CPU bus band width. READY is also used 
in multi-microprocessor 
systems to force the 8086 to wait for access to the system bus. 
To insert a wait state in a bus cycle, the READY signal to the CPU must be inactive 
(low) by the end ofT2. To avoid insertion ofa wait state, READY must be active (high) 
within a specified setup time prior to the positive transition during T3. Depending on the 
size and characteristics 
of the system, READY logic may take one of two approaches: 


(1) 
The system is normally not ready. When the selected memory or I/O device 
is ready to perform the data transfer, 
it inputs a high READY signal. 


(2) 
The system may normally be ready. If the selected memory or I/O device is 
not able to perform the data transfer at the maximum 
CPU transfer rate, it 


must then input a low READY signal. 


The "classical" 
READY implementation 
keeps the system "normally 
not ready." 


When the selected device receives a read, write, or interrupt acknowledge command, 
if 


it has had sufficient time to respond to this command, 
it inputs READY high to the 


8086; this allows the 8086 to advance the bus cycle. This implementation 
is charac- 
teristic of large multi-microprocessor, 
multibus systems, or systems where propagation 


delays, bus access delays, and device characteristics 
inherently 
slow down the system. 


Using this technique, 
devices that can run with no wait states must return READY high 


within the previously described limit for maximum 
system performance. 
Failure of a 


fast device to respond in time will cause wait clock periods to be inserted in the bus cycle. 
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An alternate 
technique 
is to have the system "normally 
ready." 
All devices are 
assumed to operate at the maximum CPU bus band width. Devices that do not meet the 
requirement 
must input READY low by the end of T2 to ensure that wait state clock 
periods will be inserted. This implementation 
is typically applied to small, single CPU 
systems; it reduces the logic required to control the READY signal. Since the failure of 
the device requiring wait states to disable READY by the end of T2 will result in pre- 
mature termination 
of the bus cycle, system timing must be carefully analyzed when 
using this approach. 


It will be shown in Chapter 
10 that the 8086 system allows the designer to com- 
bine the two READY techniques described above in a single system in order to optimize 
system performance. 


The 8086 has two different timing requirements 
for READY, depending 
on the 
system 
implementation. 
For a "normally 
not ready" 
system, 
to avoid wait states, 
READY must be high within 119 ns (TRYHCH) 
of the positive clock transition during 
T3. This is illustrated in Figure 8-22. 


A "normally 
ready" system inserts a wait state by inputting READY low within 8 
ns (TRYLCL) 
after the end ofT2 
(start ofT3), 
as illustrated in Figure 8-23. 


To guarantee 
proper operation of the 8086, the READY input must not change 
from high to low during the clock low time of T3. In both cases READY must satisfy a 
hold time of 30 ns (TCHR YX) from the T3 positive clock transition. 
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To generate 
a stable READY signal that satisfies the previous setup and hold 
times, the 8284 provides two separate system ready inputs (RDYl, 
RDY2) and a single 
synchronized 
ready output 
(READY). 
The RDY inputs are gated with separate access 
enables (AENl, 
AEN2); this allows one of the two READY signals to be selected, as 
illustrated in Figure 8-24. The gated RDY signals are logically ORed by the 8284, and 
sampled at the beginning of each CLK cycle to generate READY to the CPU. This tim- 
ing is illustrated in Figure 8-25. 
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The sampled READY signal is valid within 8 ns (TR YLCL) after CLK to satisfy 
the CPU timing requirements 
on "not 
ready" 
and "ready." 
Since READY 
cannot 
change until the next CLK, the hold time requirements 
are also satisfied. The system 
ready inputs to the 8284 (RDYl, 
RDY2) must be valid 35 ns (TRIVCL) before n,and 
AEN must be valid 60 ns before T3. For a system using only one ROY input, the associ- 
ated AEN is tied to ground while the other AEN is connected to five volts through 
IK 
ohms, as illustrated in Figure 8-26. 
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If the system generates a low active ready signal, it can be connected to the 8284 
AEN input, 
providing 
the additional 
setup time required 
by the 8284 AEN input is 
satisfied. In this case, the associated RDY input would be tied high, as illustrated 
in 
Figure 8-27. 
The majority of memory and peripheral devices that operate at less than the max- 
imum CPU frequency 
typically do not require more than one wait state. The circuit 
given in Figure 8-28 generates a single wait state. 
The system ready line in Figure 8-28 is driven low whenever 
a device requiring 
one wait state is selected. The flip-flop is cleared by ALE, enabling RDY to the 8284. If 
no wait states are required, the flip-flop does not change. If the system ready is driven 
low, the flip-flop toggles on the low-to-high 
clock transition 
of T2, to force one wait 
state. The next low-to-high 
transition 
of CLK toggles the flip-flop again, to indicate 
ready and allow completion of the bus cycle. Further changes in the state of the flip-flop 
will not affect the bus cycle. The circuit allows approximately 
100 ns for chip select to 
system ready, as illustrated in Figure 8-29. 
If the system is "normally 
not ready" programs should not assign executable code 
to the last six bytes of physical memory. Since the 8086 prefetches instructions, 
the CPU 
may attempt to access non-existent 
memory when executing code at the end of physical 
memory. If the access to non-existent 
memory fails to enable READY, the system will 


be caught in an indefinite wait. 
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The 8086 interrupt 
structure 
is based on a table of interrupt 
vectors stored 
in 
memory locations 016 through 
003FF16, 
as illustrated 
in Figure 8-30. Each vector has 
four bytes; the first two bytes hold a new program counter address, the next two bytes 
hold a new Code Segment register address. These two addresses combine to form the 
20-bit execution address of an interrupt service routine. This 20-bit address is computed 
using normal 8086 segmented 
program memory addressing. The interrupt 
vector table 
may contain up to 256 interrupt 
vectors, specifying starting addresses for interrupt 
ser- 
vice routines residing anywhere in the one megabyte address space of the 8086. If a par- 
ticular 
configuration 
uses fewer than 
256 interrupts, 
then 
you need 
only allocate 
memory for those interrupt vectors that are used. But when a system is being debugged, 
you should assign all undefined 
interrupts 
to a trap routine 
as a means of detecting 
erroneous 
interrupts. 
Each interrupt 
vector has an associated interrupt number. 
The interrupt 
number 
identifies the interrupt 
vector within the interrupt 
vector table. The interrupt 
number, 
multiplied by four, gives the absolute address for the first byte of the interrupt 
vector's 
entry within the interrupt vector table. For example, interrupt number five points to the 
sixth entry in the interrupt vector table; the first byte of this vector has the address 2010 
(= 1416), This is illustrated in Figure 8-30. 


The 8086 interrupt 
structure 
thus allows you to specify the starting 
memory 
address for every interrupt 
service routine. 
The 8086 has three types of interrupts: 
predefined 
interrupts 
that are requested 
by specific functions 
within the 8086, user defined hardware interrupts, 
and software 
interrupts. 
Predefined 
interrupts 
can be requested 
by hardware 
and/or 
software. 
Let us 
examine predefined 
interrupts 
in detail. 


"Predefined" 
interrupts are so named because they have assigned interrupt num- 
bers 
and 
automatic 
vectoring 
logic. Therefore, 
when 
a predefined 
interrupt 
is 
acknowledged, 
8086 logic automatically 
vectors to the interrupt's 
assigned vector table 
entries. 
However, 
you must initialize these vector table entries with program counter 
and code segment addresses, and you must provide each interrupt with its interrupt ser- 
vice routine. 
There are predefined hardware interrupts, 
which are requested 
by external logic, 


and there are predefined software interrupts, 
which are requested 
in consequence 
of an 
instruction's 
execution. 
Interrupt 
numbers 
0 through 
31 have been assigned to predefined 
interrupts. 
If 
you do not use a predefined interrupt, 
you can use the interrupt number for some other 
interrupt. 
But this is not recommended, 
since it may result in your system being incom- 
patible with future 8086 hardware and software products. 
We will now describe predefined interrupts, 
one at a time. 
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Figure 8-30. 
Obtaining the Interrupt 
Service Routine 
Address 
from the Interrupt 
Vector Table 


This interrupt 
is automatically 
requested 
if, following execution 
of the division 
instruction, 
the quotient 
exceeds 
the maximum 
value that the division 
instruction 
allows. The interrupt 
is non-maskable. 
It is requested 
as a part of standard 
divide 
instruction 
execution 
logic. If interrupts 
are not reenabled 
by the divide by zero inter- 
rupt service routine, then this service routine's 
execution time should be included when 
computing 
the "worst-case" 
divide instruction 
time. This becomes the longest execu- 
tion time for the divide instruction. 


This interrupt 
occurs one instruction 
after TF (the Trap Flag) is set in the Pro- 
gram Status Word. This interrupt is used to execute programs one instruction 
at a time. 
After each program instruction 
is executed, 
an interrupt 
is requested. 
Following the 
interrupt request, 
various diagnostic capabilities might be provided by an interrupt 
ser- 
vice routine, at the conclusion of which the next program instruction is executed - 
and 
another 
single stepping interrupt 
request occurs. 


To initiate single stepping, push the Program Status Word contents onto the stack; 
then set the Trap Flag bit within the saved Program Status Word at the top of the stack 
and pop the stack back to the Program Status Word. A single stepping interrupt will be 
requested 
following the next instruction's 
execution. 
When a single stepping interrupt 
request is acknowledged 
the TF flag is reset in 
the Program Status Word to prevent the single stepping interrupt 
service routine itself 
from being interrupted 
by a single step interrupt 
request. 
TF remains set in the flags 
saved in the stack. 


You should use the IRET instruction to return from a single step interrupt service 
routine. This return will restore the flags (including TF) and allow another TF interrupt 
to occur on completion 
of the next instruction. 


This is the highest priority hardware interrupt. 
As its name would imply, it is non- 
maskable. 
The NMI interrupt 
request 
input is edge triggered 
by a low-to-high 
NMI 
input transition, 
and is internally synchronized 
with a low-to-high transition of the CPU 
clock signal CLK. NMI must then remain high for at least two clock periods to guarantee 
recognition. Since any low-to-high transition of the NMI input can generate an interrupt 
request, 
spurious transitions 
must be suppressed. 
IfNMI 
is normally high, it must be low for two CPU clock periods before making 
its active low-to-high transition in order to guarantee recognition. This input is typically 
reserved for catastrophic interrupt requests, for example, following a power failure, or if 
a system watchdog timer times out. 


This is a software interrupt. 
It is generated by executing a special interrupt request 
instruction that occupies a single byte of object code. This interrupt instruction is used to 
set breakpoints 
in software debug programs. Since the smallest 8086 instruction 
object 
code is one byte, the one byte interrupt can replace any 8086 instruction 
as a means of 
setting breakpoints. 
The one byte interrupt 
is not maskable. 


This interrupt 
request 
occurs if the Overflow Flag (OF) is set in the Program 
Status Word, and the INTO instruction 
is executed. 
The INTO instruction 
allows the 
8086 to trap to an overflow error service routine. Interrupt 
on overflow is non-maska- 
ble. 


You can generate a software interrupt by executing the two byte interrupt INT nn 
instruction. 
The first object code byte is the INT op-code; the second object code byte 
(nn) contains the number 
of the interrupt 
to be executed. 
The INT instruction 
is not 
maskable. 


This instruction 
is frequently used to call dynamically relocatable programs; the 
called program's 
location in memory is not known by the calling program. However, 


when the called program is loaded into memory, its execution address is loaded into its 
interrupt 
vector. The called program 
must 
return 
with an interrupt 
return 
(IRET) 
instruction. 


Maskable hardware interrupts 
are requested 
via the INTR pin of the 8086; these 
interrupts 
can be masked by the IF bit (Interrupt 
Flag) of the Program Status Word. 


During the last clock period of each instruction's 
execution, 
the state of the INTR pin is 
sampled. There are two exceptions to this rule: 


1. 
When the instruction 
is a MOV to a segment register or a POP to a segment 
register. 


2. 
During 
execution 
of an instruction 
prefix, which is treated 
as part of the 
instruction 
it precedes. 


These two exceptions 
will be discussed following a description 
of the "general 
case" interrupt 
acknowledge execution 
sequence. 


We will describe 
the interrupt 
acknowledge 
sequence, 
taking the user defined 


hardware interrupt 
as the "general 
case." 
If the INTR signal is high when sampled, and the IF bit in the Program Status 


Word is 0, then 
a user defined 
interrupt 
has been requested; 
these interrupts 
are 


enabled, 
so the 8086 executes 
an interrupt 
acknowledge 
sequence. 
To guarantee 
the 


interrupt 
has been acknowledged, 
the INTR input must be held high until the 8086 


returns an interrupt acknowledge, via INT A in a minimum system, or via SO,S1, and S2 
in a maximum 
system. 


If the BIU is a bus cycle when the interrupt condition is detected as would occur if 


the BIU is fetching an instruction 
when the current instruction 
completes executing, 


then the interrupt request must be valid at INTR for two clock periods prior to T4 of the 
bus cycle, otherwise another 
bus cycle will be executed 
(if one is pending) before the 
interrupt 
acknowledge is issued. 
If a hold request is pending, as might occur if an interrupt and hold are requested 


while a locked instruction 
is executing, 
then hold is serviced first, and the interrupt 
is 
acknowledged 
after the hold has been serviced. 


Only user defined hardware interrupt requests occurring at the INTR pin receive a 


specific hardware 
acknowledge. 
This acknowledge 
takes the form of two interrupt 
acknowledge bus cycles, separated by two idle clock periods, as illustrated in Figure 8- 
31. Software interrupts 
and non-maskable 
interrupts 
do not receive the acknowledge 
sequence illustrated 
in Figure 8-31. 


The complete interrupt acknowledge sequence, 
as illustrated in Figure 8-30, con- 


sists of two INT A bus cycles, separated by two idle clock periods. During the two bus 
cycles, INT A is output 
low (in minimum 
mode) 
to acknowledge 
the interrupt. 
The 
address/data 
bus (including BHE), and the associated status (S3-S7) is floated during 
both bus cycles; however, a high ALE pulse is output, so address latches will be loaded 
with indeterminate 
information. 
Therefore devices should always use READ (RD) low 
as a qualifier before driving their outputs. 
During the INTA bus cycles, DT/R and DEN are active; this allows the 8086 to 
receive a one-byte interrupt 
number 
from the device requesting 
the interrupt. 


The first INT A bus cycle signals that an interrupt acknowledge is in progress; this 
allows the interrupting 
device time to ready its interrupt number for transmittal 
during 
the next INT A bus cycle. The interrupt number must be transferred 
to the 8086 on the 
lower half of the 16-bit data bus during the second INT A bus cycle. Therefore 
devices 
that supply interrupt 
vectors must connect to the lower half of the 16-bit data bus. 


Timing for INT A bus cycles (with the exception of address timing) is the same as 
read bus cycle timing. 
Note that the 8086 interrupt 
acknowledge sequence deviates from the 8080 and 


the 8085 in that no instruction 
is read by the CPU during the interrupt 
acknowledge 
sequence. The 8080 and 8085 require either a restart or a call instruction 
to be issued by 
the interrupting 
device to the CPU as part of the acknowledge sequence. 


In the minimum 
mode system, 
the MilO 
signal will be low during 
interrupt 
acknowledge bus cycles. 
The 8086 prevents the BIU from honoring a hold request occurring between the 


two INT A cycles. 
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In a maximum mode system, status lines SO-S2will cause the 8288 Bus Controller 
to output INT A low during each interrupt acknowledge bus cycle. The LOCK output of 
the 8086 will be active from T2 of the first interrupt 
acknowledge bus cycle until T2 of 
the second interrupt 
acknowledge bus cycle to prevent the 8086 from honoring a hold 
request on either RQ/GT input, and to prevent bus arbitration logic from relinquishing 
the bus between 
the two interrupt 
acknowledge 
bus cycles in multimaster 
systems. 
READY logic functions identically in interrupt 
acknowledge, 
read or write bus cyles. 
The 8086 will not sample INTR after a MOV to a segment register or POP to a seg- 
ment register; this allows a 32-bit pointer to be loaded into the Stack Pointer SS and SP 
registers without the possibility of an interrupt 
separating the two loads. 
Here is an example of an uninterruptable 
instruction 
sequence: 


NEW$STACK$SEGMENT 
NEW$ST ACK$POINTER 


The 8086 will not sample 
INTR 
after executing 
an instruction 
prefix, 
since 
prefixes are treated as part of the instruction 
they precede. The one exception to this 
rule occurs when a string primitive is preceded by the Repeat 
(REP) prefix. The re- 
peated string operation 
will sample INTR after completing each repeated string primi- 
tive's execution. This includes repeat string operations having a LOCK prefix. If multi- 
ple prefixes precede a repeated string operation, and the instruction 
is interrupted, 
only 
the prefix immediately preceding the string primitive is restored following a return from 
the interrupt routine. To allow correct resumption 
of program execution you should use 
the following programming 
technique: 


LOCK 
REP 
MOVS 
AND 
JNZ 


DEST,CS: 
SOURCE 
CX,CX 
LOCKED$BLOCK$MOVE 


The object code bytes generated 
for the MOVS instruction 
are (in descending 
order) LOCK prefix, REP prefix, Segment Override prefix, and MOVS. Upon return 
from the interrupt, 
the segment override prefix is restored to guarantee that one addi- 
tional transfer will occur between the correct memory locations. The instructions follow- 
ing the move test the repetition count value to determine 
if the move was completed; 
a 


return to the block move instruction 
occurs if the move was not completed. 


The 8086 reads the interrupt 
number 
from the bus for hardware interrupts 
and 


from the instruction 
stream for software interrupts. 
The interrupt number is multiplied 


by four to generate the address of the corresponding 
interrupt 
vector in the interrupt 
vector table. The four bytes of the interrupt 
vector are: 


Least significant byte for the program counter. 


Most significant byte for the program counter. 


Least significant byte for the Code Segment register. 


Most significant byte for the Code Segment register. 


Next the 8086 pushes the Program Status Word contents 
onto the stack, resets 


the trap and interrupt flags, then pushes the current Code Segment register and the pro- 
gram counter 
contents 
onto the stack. The new Code Segment register and program 
counter contents are loaded from the interrupt vector table; read bus cycles are executed 
for this to occur. 


No segment registers are used when referencing the interrupt vector table during 
the interrupt 
acknowledge sequence. The vector displacement 
is added to zero to form 
the 20-bit address; S4 is 1 and S3 is 0, indicating no segment register selection. 


This is the actual bus sequence executed when a user defined, maskable interrupt 
is acknowledged: 


1. 
Two interrupt 
acknowledge 
bus cycles are executed, 
separated 
by two idle 
clock periods. As illustrated in Figure 8-30, the acknowledged device returns 
an 
interrupt 
number, 
as a byte 
of data, 
during 
the 
second 
interrupt 
acknowledge bus cycle. This data byte, shifted left two bit positions, becomes 
the interrupt 
vector starting address. 


2. 
A read bus cycle is executed, 
during which new CS register contents are read 
from the first two interrupt 
vector bytes. 


3. 
A read bus cycle is executed, during which new program counter contents are 
read from the third and fourth interrupt 
vector bytes. 


4. 
A write bus cycle is executed, 
during which the Program Status Word con- 
tents is pushed onto the stack. 


5. 
The Interrupt en and Test (TF) flags in the Program Status Word are reset to 
O. This disables maskable or single step interrupts. 


6. 
A write bus cycle is executed, during which the CS register contents is pushed 
onto the stack. 


7. 
A write bus cycle is executed, 
during which program counter 
contents 
are 
pushed onto the Stack. 


Program 
execution 
now 
branches 
to the 
interrupt 
service 
routine - 
whose 
address has been fetched from the interrupt 
vector. 


When a non-maskable 
interrupt, 
a software interrupt, 
or a single step interrupt 
is 
acknowledged, 
steps 2 through 
7 above are executed; 
step 1 is not needed since the 
interrupt 
number 
is known. 


62 clock periods separate the end of the instruction 
during which a user defined 
maskable interrupt 
is requested, 
and the start of interrupt 
service routine execution. 


The same sequence 
of bus cycles is executed 
for software generated 
interrupts, 
except that no interrupt acknowledge bus cycles are executed. In consequence, 
the delay 
to execution 
of the interrupt 
service routine is 51 clock periods for INT nn and single 
step, 52 clock periods for INT3, and 53 clock periods for INTO. 


If wait states are inserted in any bus cycle, the number 
of interrupt 
acknowledge 
clock periods given above will, of course, increase accordingly. 


Let us now examine multiple interrupts 
and interrupt 
priorities. 
Only external 
interrupts 
requested 
via INTR can be disabled. In consequence, 


these 
interrupts 
have lowest priority. 
Any other 
interrupt's 
acknowledge 
sequence 
resets the IF flag in the Program 
Status Word. 
An interrupt 
requested 
via INTR, 
therefore, 
cannot be acknowledged until other interrupt service routines are completed 
or interrupts 
are reenabled 
(IF flag is set). 


A 
program 
being 
debugged 
using 
single 
stepping 
could 
be 
modified 
to 
acknowledge external, user defined interrupts 
only within the single step interrupt ser- 
vice routine. 
This will allow the external 
interrupts 
to be serviced quickly in spite of 
single stepping. To do this requires that the single step interrupt service routine reset the 
IF flag for the interrupted 
program, which will be in the Program Status Word stored in 
two words from the top of the stack, and enable interrupts during the single step routine. 
This may be illustrated as follows: 


rSingle 
Step prog0--j 
r-Single 
Step Prog0--j 


-......... 
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..........- 
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• 
Interrupts enabled 


R Interrupts disabled 


On the other hand, you may wish to single step the interrupted 
program only - 
or 


the external, user defined interrupt's 
service routine only. If the TF flag in the Program 
Status Word is set to 1 by the interrupted 
program, then the interrupted 
program will be 
subject to single stepping; otherwise it will not. In either case, a user defined interrupt's 
service routine will begin execution with TF reset to 0, and single stepping consequently 
disabled. Program logic within the interrupt service routine must therefore enable single 
stepping for the duration 
of the interrupt 
service routine's 
execution. 
You can, if you wish, disable INTR within single stepping traps. This requires that 
the single stepping interrupt 
service routine keep the IF flag reset to 0 within the Pro- 
gram Status Word. Bus disabling INTR, for what could become a long time, might dis- 
rupt your program logic. 


We will now examine non-maskable 
interrupt priorities. We have described three 
such interrupts: 
NMI, single stepping, and software traps. All have priority over user 
defined external interrupts 
requested 
via INTR. Among themselves, 
when two of the 
three 
non-maskable 
interrupts 
occur 
simultaneously, 
single 
stepping 
has 
highest 
priority, followed by NMI, with software traps having lowest priority. But when all three 
non-maskable 
interrupts 
are requested 
simultaneously, 
NMI 
has highest 
priority, 
followed by software traps, with single stepping acquiring lowest priority. 


Since single stepping may have higher priority than NMI, or lower priority, the 
single stepping interrupt service routine will have to examine whether its execution does 
or does not follow an NMI interrupt. 
If it does follow an NMI interrupt and you wish to 
immediately service the NMI, the single stepping interrupt service routine must contain 
logic to disable itself. This program logic will examine the return address at the top of the 
stack, and upon detecting an NMI interrupt service routine address, it need only return, 
allowing the NMI routine to execute. The NMI routine will return to the program being 
single stepped and single stepping is automatically 
reenabled by restoring the flags dur- 
ing the return. The net affect is: if the NMI is detected, single stepping is bypassed for 
one instruction 
of the program being single stepped. Since single stepping is disabled 
during the interrupt 
acknowledge process, the NMI interrupt service routine need only 
keep the TF flag reset to 0 within its Program Status Word in order to disable single step- 
ping for the duration 
of its execution. 


The 
8259A 
Priority 
Interrupt 
Controller 
can handle 
multiple, 
external 
user 
defined interrupts 
requested 
via INTR. This device will operate in 8080A/8085 
or 8086 
systems. The 8259A is cascadable; in master/slave 
configurations 
it will handle up to 64 
interrupts 
within a single system. 
Figures 8-32 and 8-33 illustrate 8259As in minimum 
and maximum 
mode 8086 
systems. 
The minimum 
mode configuration 
illustrated 
in Figure 
8-32a shows an 8259A 
connected to an 8086 multiplexed bus. The configuration 
shown in Figure 8-32b illustr- 
ates an 8259A connected 
to a demultiplexed 
bus system. These interconnections 
are 
also applicable to maximum 
mode systems. The configuration 
given for a maximum 
mode system shows a master 8259A on the 8086 multiplexed bus, with additional slave 
8259As out on the buffered system bus. This configuration demonstrates 
several unique 
characteristics 
of the maximum 
mode system interface. If the master 8259A receives 
interrupts 
from a mix of slave 8259As and regular interrupting 
devices, the slaves must 
provide the interrupt 
numbers 
for devices connected 
to them, while the master must 
provide the interrupt 
numbers 
for devices directly attached to its interrupt 
inputs. The 
master 8259A can determine 
if an interrupt is being received directly from the request- 
ing device or from a slave 8259A. The master 8259A uses this information 
to enable or 
disable data bus transceivers 
(via the NAND function of DEN and EN). If the master 
8259A must provide the interrupt number, 
it will disable the data bus transceivers. 
If a 
slave 
8259A 
must 
provide 
the 
type number, 
the 
8086 will enable 
the 
data 
bus 
transceivers. 
The EN output is normally high, allowing the 8086/8288 to control the bus 
transceivers. 
To select the proper slave when servicing a slave interrupt, 
the master 
must provide a cascade address (CAS) to the slave. If the 8288 is not strapped in the I/O 
bus mode (the 8288 lOB input connected to ground), 
the MCE/PDEN 
output becomes 
an MCE or Master Cascade Enable output 
(use of the I/O bus mode is explained 
in 
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Figure 8-32a. 
82595 Connected 
to a Minimum 
Mode 8086 
- 
Multiplexed 
Bus 
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Figure 8-32b. 
82595 Connected to a Minimum 
Mode 8086 


- 
Demultiplexed 
Bus 


Chapter 
10). MCE is active only during INT A cycles, as shown in Figure 8-34. MCE 


enables the master 8259A's cascade address onto the 8086's local bus during ALE. 


This allows the address latches to capture the cascade address with ALE, with the 


system address bus being used to select the proper slave 8259A. MCE is gated with 
LOCK to minimize local bus contention 
between the 8086 floating its bus output, 
and 
the cascade address (CAS) being enabled onto the bus. The first INT A bus cycle allows 
the master 8259A to resolve internal priorities and output of the cascade address (CAS), 
which is transmitted 
to the slaves during the second {NTA bus cycle. For additional 
information 
on the 8259A, refer to Intel Application Note AP59, or An Introduction to 


Microcomputers - 
Volume 2, Some Real Microprocessors~ by A. Osborne. 
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I INTERPRETING THE 8086 
BUS 
TIMING 
DIAGRAMS 


8086 minimum 
and 
maximum 
mode 
bus timing 
diagrams 
are shown 
in the data 
sheets 
at the back of this book. 
These 
timing 
diagrams 
may be divided 
into six sections 
as follows: 


1. 
Address 
and 
ALE 
timing 


2. 
Read 
cycle timing 


3. 
Write 
cycle timing 


4. 
Interrupt 
acknowledge 
timing 


5. 
Ready 
timing 


6. 
Bus control 
transfer 
timing 


Since the A.C. characteristics 
of the signal are specified 
relative 
to the CPU clock, 


the relationship 
between 
the majority 
of signals 
can be deduced 
by simply 
determining 
the 
clock 
periods 
that 
separate 
the clock 
edges 
to which 
the 
signals 
are relative, 
then 


adding 
or 
subtracting 
the 
appropriate 
minimum/maximum 
parameter 
values. 
One 
aspect 
of system 
timing 
not compensated 
for in this approach 
is the "worst 
case" 
rela- 
tionship 
between 
minimum 
and 
maximum 
parameter 
values 
(also 
known 
as tracking 
relationships). 
For 
example, 
consider 
a signal 
that 
has 
specified 
minimum 
and 
max- 
imum 
turn 
on and turn 
off delays. 
Depending 
on device 
characteristics, 
it may not be 
possible 
for 
a component 
to simultaneously 
demonstrate 
a maximum 
turn-on 
and 
minimum 
turn-off 
delay, 
even 
though 
worst 
case analysis 
might 
imply 
this possibility. 
This 
argument 
is characteristic 
of MOS devices 
and is therefore 
applicable 
to the 
8086 
A.C. 
characteristics. 
The 
message 
is: worst 
case 
analysis 
mixing/minimum 
and 
max- 
imum 
delay 
parameters 
will typically 
exceed 
the worst 
case obtainable. 
Therefore 
they 
should 
not be subjectively 
degraded 
further, 
to obtain 
worse-worst 
case values. 
We will 


now examine 
guidelines 
for specific 
areas 
of 8086 timing 
that 
are sensitive 
to tracking 
relationships. 


The address/ALE 
timing relationship 
is important 
since it determines 
a device's 
ability to capture a valid address from the multiplexed 
bus. Since the 8282 and 8283 


latches capture the address on the trailing edge of ALE, the critical timing involves the 
state of the address lines when ALE terminates. The parameter TAVAL = TCLCH - 60 
ns guarantees that addresses are valid at the CPU 58 ns before the trailing edge of ALE. 
This satisfies the zero data setup time to end of strobe required by the 8282/8283 and 
assures 
that a valid address 
is captured. 
The address 
is guaranteed 
to remain 
valid 
beyond the end of ALE by the TLLAZ parameter. This specification overrides the rela- 
tionship between TCHLL and TCLAX, which might seem to imply that the address may 
not be valid by the latest possible ALE. TLLAZ timing applies to the entire address bus, 
even 
though 
only shown 
for A19-A16 
in the timing 
diagram. 
The TCLAX 
min 
specification on the address indicates the earliest possible time the bus will float if not 
restrained 
by a slow ALE. TCLAX only applies to the multiplexed 
address/data 
lines 
AD15-0 during read cycles. During write cycles, the multiplexed 
Address/Data 
Bus 
switches directly from address to write data. Address hold time to ALE is again guaran- 
teed by the TLLAZ specification, with the absolute minimum 
(for the case of an early 
ALE termination) 
specified by TCLAX. For both the read and write case, the A19-A16 
lines switch directly from address to status with the same timing as the multiplexed 
address/data 
bus for the write case. The minimum 
ALE pulse width is guaranteed 
by 
TLHLL min, which takes precedence over the value obtained by relating TCLLH max 
and TCHLI min. 


To determine 
the worst delay to valid address on a demultiplexed 
address bus, 
two paths must be considered: 


1. 
Delay of valid address 


2. 
Delay of ALE 


Since the 8282 and 8283 are flow through latches, a valid address is not transmitted 
to 
the address bus until ALE is active. A comparison of address valid delay TCLA V max, 
with ALE active delay TCLLH max indicates TCLA V max is the worst case. Subtracting 
the latch propagation delay gives the worst case address bus valid delay from the start of 
the bus cycle. 


Read cycle timing consists of three parts: 


1. 
Conditioning 
the bus 


2. 
Activating the Read Control signal 


3. 
Establishing the data transceiver 
enable and direction controls 


If the memory or I/O devices are connected directly to the multiplexed 
address/ 
data bus, the TAZRL parameter 
guarantees 
that the 8086 will float the bus before 
activating the read control and allowing the selected device to drive the bus. At the end 
of the bus cycle, the TRHA V parameter 
specifies the bus float delay the device being 
deselected must satisfy if it is to avoid contention 
with the 8086 driving the address for 
the next bus cycle. The next bus cycle may start during the CLK period following T4 or 
any number 
of CLK periods later. 


The minimum 
delay from read active to valid data at the CPU is 2TCLCL 
- 


TCLRL max - TDVCL = 205 ns. The minimum pulse width is 2TRLRH which gives a 
minimum 
pulse width of 325 ns. 
DT /R: is established 
early in the bus cycle and requires no further consideration. 


During a read, the DEN signal must allow the transceivers to propagate data to the 
CPU with the appropriate 
data set up time, and continue to do so for the required hold 
time. The DEN turn on delay allows TCLCL + TCHCL 
min - 
TCVCTV max - 


TDVCL = 127 ns transceiver enable time prior to valid data required by the 8086. Since 
the 8086 data hold time TCLDZ min and the minimum 
DEN turn off delay TCVCTX 
min are both 10 ns relative to the same clock edge, the hold time is guaranteed. 
Addi- 
tionally, DEN must disable the transceivers 
prior to the 8086 driving the bus with the 
address for the next bus cycle. The maximum 
DEN turn off delay (TCVCTX max), 
compared with the minimum 
delay for the addresses 
out of the 8086 (TCLRH min), 
indicates the transceivers 
are disabled at least 55 ns before the CPU drives the address 
onto the multiplexed 
bus. 


The write cycle consists of three major functions: 


I. 
Providing write data to the system 


2. 
Generating 
the write command 


3. 
Controlling 
data bus transceivers 


The Write Data and Write command 
are both enabled from the leading edge of 


T2. Comparing minimum 
WR active delay TCVCTV min with the maximum write data 


delay TCLDV indicates that write data may not be valid until 100 ns after write is active. 
Therefore, 
devices in the system should capture data on the trailing edge of the Write 
command rather than the leading edge to guarantee 
valid data. The 8086 floats the bus 
after write only if forced ofTthe bus by a HOLD or RQ/GT input, otherwise the 8086 
simply switches the output drivers from data to address at the beginning of the next bus 
cycle. As with the read cycle, the next bus cycle may start in the clock period following 
T4 or any later clock period. 
Data from the 8086 is valid a minimum 
of 2TCLCL - TCLDV max + TCVCTX 


min = 300 ns before the trailing edge of WRITE. The minimum 
WRITE pulse width is 


TWLWH = 340 ns. The CPU maintains 
valid write data TWHDX ns after write. The 


TWHDZ specification overrides the result derived by relating TCLCH min and TCHDZ 
min, which implies write data may only be valid 18 ns after WR. The TCHDZ minimum 
bus float time takes effect only if TCVCTX + TWHDZ < TCLCH + TCHDZ. 


The transceiver 
direction control signal DT/R: is conditioned 
to transmit 
at the 
end of each read cycle; it does not change 
during 
a write cycle. This allows the 


transceiver 
enable signal DEN to be active early in the cycle, while addresses are valid, 


without corrupting the address on the multiplexed bus. DEN is disabled a minimum 
of 
TCLCH min + TCVCTX min - 
TCVCTX max = 18 ns after write, to guarantee 
data 


hold time to the selected device. Since we are again evaluating 
a minimum 
TCVTCX 
with a maximum TCVTCX, the real delay from the end of write to transceiver disable is 
approximately 
60 ns. 


The interrupt 
acknowledge sequence 
consists of two interrupt 
acknowledge 
bus 
cycles. Timing of each cycle is identical to read cycle timing, with two exceptions: control 
signal timing and address/data 
bus timing. 
The INT A control signal has the same timing as the WR Control signal. INT A is 
active within 110 ns of the start of T2, providing 260 ns of access time from control to 
data valid at the 8086. The INT A control is active following the leading edge of T4 for a 
minimum 
of TCVCTX min = 
10 ns, to satisfy the data hold time of the 8086. This 
insures that the minimum 
INT A pulse width is 300 ns; however, 
taking signal delay 
tracking into consideration 
(TCVCTX min = 50 if TCVCTX 
max = 
110), gives a 
minimum 
pulse width 
of 340 ns. Since the maximum 
inactive 
delay of INT A is 
TCVCTX max = 110 ns, and the 8086 will not drive the bus until 15 ns (TCLA V min) 
into the next clock cycle, 105 ns are available for interrupt 
devices on the local bus to 
float their outputs. 
If the data bus is buffered, DEN provides the same amount of time 
for local bus transceivers 
to float their outputs. 
The multiplexed address/data 
bus is floated from Tl at the beginning of the INT A 
cycle, within TCLAZ ns. The upper four multiplexed 
address/status 
lines do not float. 


The address value on A19-A16 is indeterminate, 
but the status information will be valid 
(S3=0, 
S4=0, 
S5=IF, 
S6=0, 
S7=BHE=0). 
The multiplexed 
address/data 
lines will 
remain floating until the clock period following T4 of the INT A bus cycle. This sequence 
occurs for both of the INTA bus cycles. The interrupt 
number read by the 8086 on the 
second INT A bus cycle must satisfy the data setup and hold times of a read cycle. 
The DEN and DT/R signals are enabled for each INT A cycle; they do not remain 
active between the two cycles. Timing for these two signals is identical in INT A and 
Read bus cycles. 


The detailed 
timing requirements 
of the 8086 READY 
signal and the system 
Ready signal (RDY) input into the 8284 were given earlier in this chapter. The system 
Ready signal (RDY) is typically generated from either the address decode of the selected 
device or the address and control signals RD, WR, INT A. 


If RDY is enabled by the address decode, there are two cases to consider. For a 
system which is normally not ready, the time to generate ready from a valid address and 
not insert a wait state is 2TCLCL - TCLA V max - 
TRIVCL max = 255 ns. This time 
is available for buffer delays and address decoding to determine 
if the selected device 
does not require a wait state and drive the RDY line high. If wait clock periods are 
required, user hardware must provide the appropriate ready delay. Since the address will 
not change until the next ALE, RDY will remain valid throughout 
the bus cycle. For a 
system which is normally ready, selected devices requiring wait states also have 255 ns 
to disable the RDY lines. User hardware must delay reenabling RDY by the appropriate 
number 
of wait state clock periods. 
IfRDY is enabled by the RD control, TCLCL - TCLRL max - TRIVCL max = 
15 ns are available for external logic. If the WR control is used, TCLCL - 
TCVCTV 
max - 
TRIVCL max = 55 ns are available. 
Comparison 
of RDY generated 
by an address 
or control signal indicates 
that 
address decoding provides the best timing. If the system is normally not ready, address 
decoding alone could be used to provide RDY for devices not requiring wait states, 
while devices requiring wait states may use a combination 
of address decode and control 
signals to activate a wait state generator. 
If the system is normally ready, devices not 
requiring wait states do nothing to RDY, while devices needing wait states should disa- 
ble RDY via the address decode, and use a combination 
of address decode and control 
signals to activate a delay until RDY is reenabled. 
If the system requires no wait states for memory, and a fixed number of wait states 
for RD and WR to all I/O devices, the MilO signal can be used as an early indication 
that wait state clock periods are needed. This allows a common circuit to control ready 
timing for the entire system, without feedback of address decodes. 


Detailed HOLD/HLDA 
timing is covered later in this chapter. 


The TEST input is sampled 
by the 8086 only during execution 
of the WAfT 
instruction. 
The TEST signal should be active for a minimum of six clock periods during 
the WAIT instruction 
to guarantee 
detection. 


The maximum mode 8086 bus operations are logically equivalent to the minimum 
mode operation. 
Detailed timing analysis now involves signals generated 
by the 8086 
CPU and the 8288 Bus Controller. 


In addition to supplying signals provided by a minimum mode 8086, the 8288 pro- 
vides additional control signals that expand the flexibility of the system. In the following 
discussion, 
when calculating signal relationships, 
be sure to use the proper maximum 
mode values, rather than equivalent 
minimum 
mode values. 


In maximum 
mode, address information 
continues 
to come from the 8086, but 
the ALE strobe is generated 
by the 8288 Bus Controller. 
To determine 
the worst case 
relationships 
between ALE and a valid address, activation of the 8288 ALE relative to 
the SO-S2 status from the 8086 must be analyzed. 
The maximum mode timing diagram specifies two possible delay paths to generate 
ALE. The first is TCHSV + TSVLH, measured from the rising edge of the clock period 
preceding Tl. The second path is TCLLH, measured from the start ofTl. 
Since the 8288 
initiates a bus cycle from the status lines leaving the passive state (SO, SI, Sf = 1,1,1), 
if the 8086 is late in issuing the status 
(TCHSV max) while the clock high time is a 
minimum 
(TCHCL min), the status will not have changed by the start of Tl, and ALE 
is issued TSVLH ns after the status changes. If the status changes prior to the beginning 
of Tl, the 8288 will not issue the ALE until TCLLH ns after the start of Tl. The result- 
ing worst case delay to enable ALE (relative to start of Tl) is TCHSV max + TSVLH 
max - 
TCHCL min = 58 ns. 
The trailing edge of ALE is triggered in the 8288 by the positive clock edge in Tl, 
regardless 
of the delay to enable ALE. The resulting 
minimum 
ALE pulse width is 
TCLCH max - 
58 ns = 75 ns assuming TCHLL = O.TCLCH max must be used, since 
TCHCL min was assumed to derive the 58 ns ALE enable delay. The address is guaran- 
teed to be valid TCLCH min + TCHLL min - TCLA V max = 8 ns prior to the trailing 
edge of ALE to capture the address in the 8288 or 8283 latches. Again we have assumed 
a very conservative 
TCHLL = O. Note that since the address and ALE are driven by 
separate devices, no tracking of A.C. characteristics 
can be assumed. 
The address hold time to the latches is guaranteed 
by the address remaining valid 
until the end of Tl, while ALE is disabled a maximum 
of 15 ns from the positive clock 
transition in Tl (TCHCL min - 
TCHLL max = 52 ns address hold time). The multi- 
plexed bus transitions 
from address to status and write data, or tristate (for read) are 
identical to minimum 
mode timing. 
Also, since the address 
valid delay (TCLA V) 
remains the critical path in establishing a valid address, the address access times to Valid 
Data and Ready are the same as the minimum 
mode system. 


The maximum 
mode system offers two read signals, generated separately by the 
8086 and the 8288. The 8086 RD output signal timing is identical to the minimum mode 
system, but the A.C. characteristics 
of the Read Control signal generated 
by the 8288 
are significantly 
better. 
Devices 
on a demultiplexed 
buffered 
system 
bus should 
therefore use the 8288 Read Control signal. The 8086 RD signal is available for devices 
that reside directly on the multiplexed 
bus. 
The following evaluation 
only considers the 8288 Read Control signal timing. 
The 8288 outputs 
separate 
Memory and I/O Read Control signals (lORC and 
MRDC); 
both have the same A.C. characteristics. 
These control signals are issued 
TCLML ns after the start of T2; they terminate 
TCLMH ns after the start of T4. The 
minimum 
control pulse length is 2TCLCL - 
TCLML max + TCLML min = 375 ns. 
The access time to valid data at the 8086 is 2TCLCL - TCLML max - TDVCL max = 
335 ns. Since the 8288 was designed for systems with buffered data busses, control sig- 
nals 10RC and MRDC are enabled before the 8086 has floated the multiplexed 
bus; 
therefore control signals 10RC and MRDC should not be used by devices that connect 
directly to the multiplexed 
bus, otherwise bus contention 
could result during 8086 bus 
float and device turn on. 
The direction control for data bus transceivers 
is established 
in TI. Transceivers 
are enabled by DEN until the positive clock transition 
of T2. This provides TCLCH + 
TCVNV min = 123 ns for 8086 bus float delay, and TCHCL min + TCVNV max - 
TDVCL max = 187 ns of transceiver active to data valid at the 8086. Since both DEN 
and control signals are valid a minimum 
of 10 ns into T4, the 8086 data hold time 
TCLDZ is guaranteed. 
A maximum 
DEN disable of 45 ns (TCVNX max) guarantees 
the transceivers 
are disabled by the start of the next 8086 bus cycle (215 ns minimum 
from the same clock edge). On a positive clock transition 
of T4, DTiR: is returned 
to 
transmit, 
in preparation 
for a possible write operation on the next bus cycle. Since the 
system memory and I/O devices reside on a buffered system bus, they must float their 
outputs before the device for the next bus cycle is selected (approximately 
2TCLCL), or 
the transceivers 
drive write data onto the bus (approximately 
2TCLCL). 


In the maximum mode, the 8288 provides normal and advanced write control sig- 
nals for memory and I/O (MWTC, AMWC, IOWC, AIOWC). The advanced write con- 
trol signals are active a full clock period ahead of the normal write control signals. The 
timing for advanced write control signals is identical to the timing for the read control 
signals. The advanced Write pulse width is 2TCLCL - TCLML max + TCLMH min = 
375 ns, while a normal write pulse width is TCLCL - 
TCLML max + TCLMH min = 
175 ns. Write data set up time to the selected device is a function of either the data valid 
delay from the 8086 (TCLDV), 
or the transceiver 
enable delay (TCVNV). The worst 
case delay to valid write data is TCLDV = 110 ns, minus transceiver propagation delays. 
This implies that data may not be valid until 
100 ns after the leading edge of the 
advanced write control signal, but will be valid approximately 
TCLCL - 
TCLDV max 
+ TCLML min = 100 ns prior to the leading edge of the normal write control signal. 
Data will be valid 2TCLCL - TCLDV max + TCLMH min = 300 ns before the trailing 
edge of either write control signal. The data and control signal overlap for the advanced 
write control is 300 ns, while the overlap with a normal write control is 175 ns. The 
transceivers 
are disabled a minimum 
of TCLCH min - 
TCLMH max + TCVNX min 
= 85 ns after write control, while the 8086 provides valid data a minimum 
of TCLCH 
min - 
TCLMH max + TCHDZ min = 85 ns. This guarantees write data hold of 85 ns 
after the write control. 
The transceivers 
are disabled 
TCLCL 
- 
TCVNX 
max + 
TCHDTL min = 155 ns (assuming TCHDTL = 0) prior to transceiver direction change 
for a subsequent 
read bus cycle. 


The maximum 
mode INTA sequence is logically identical to the minimum 
mode 
sequence. The transceiver 
control (DEN and DT/R) and INTA control timing of both 
of the interrupt acknowledge cycles are identical to the transceiver control timing of the 
read cycle. As in the minimum mode system, the multiplexed address/data 
bus will float 
from the leading edge of Tl for each INT A bus cycle and will not be driven by the 8086 
until after T4 of each INT A cycle. The setup and hold times on the vector returned 
by 
external hardware during the second INT A cycle are the same as data setup and hold for 
the read bus cycle. If the device providing the interrupt 
vector is connected to the local 
bus, TCLCL - 
TCLAZ max + TCLML min = 130 ns are available from the 8086 bus 
float to INT A command 
active. The selected device on the local bus must disable the 
system data bus transceivers, 
since DEN is still generated 
by the 8288. 


If the 8288 is not in the lOB (110 Bus) mode, 
the 8288 MCE/PDEN 
output 


becomes the MCE output. This output is active during each INT A cycle and overlaps 
the ALE signal during TI. The MCE is available for gating cascade addresses 
from a 


master 8259A onto three of the upper ADI5-AD8Iines; 
also MCE allows ALE to latch 
the cascade address into the address latches. The address lines may then be used to pro- 
vide CAS address selection to slave 8259As located on the system bus. (Refer to Figure 
8-32 for a description of this technique.) 
MCE is active within 15 ns of status or the start 


of TI for each INT A cycle. MCE should not enable the CAS lines onto the mutliplexed 
bus during the first cycle, since the 8086 does not guarantee to float the bus until 80 ns 
into the first INT A cycle. The first MCE can be inhibited by gating MCE with LOCK. 
The 8086 LOCK output is activated during T2 of the first INT A cycle; it is disabled dur- 
ing T2 of the second INT A cycle. The overlap of LOCK with MCE allows the first MCE 
to be masked and the second MCE to gate the cascade address onto the local bus. Since 
the 8259A will not provide a cascade address until the second INT A bus cycle, no infor- 
mation is lost. As with ALE, MCE is guaranteed 
valid within 58 ns of the start of TI to 
allow 75 ns CAS address 
set up of the trailing edge of ALE. MCE remains 
active 


TCHCL min - 
TCHLL max + TCLMCL min = 52 ns after ALE to provide data hold 
time to the latches. 
If the 8288 is strapped in the lOB mode, the MCE output becomes PDEN and all 


I/O references are assumed to be devices on the local bus rather than on the demulti- 
plexed system 
bus. Since INT A cycles are considered 
I/O cycles, all interrupts 
are 
assumed to come from the local system bus, and cascade addresses are not gated onto 
the system address 
bus. Additionally, 
the DEN signal is not enabled 
since no I/O 
transfers occur on the system bus. If the local I/O bus is also buffered by transceivers, 
the PDEN signal is used to enable those transceivers. 
PDEN A.C. characteristics 
are 


identical to DEN, with PDEN enabled for I/O references and DEN enabled for instruc- 
tion or data memory references. The system implications of the various modes are dis- 
cussed in a later chapter. 


Ready timing, when based on the address valid timing, is the same for maximum 
and minimum 
mode systems. The delay from 8288 control valid to RDY valid at the 


8284 is TCLCL - 
TCLML max - 
TRIVCL min = 130 ns. This time can be used by 
external 
circuits to determine 
whether 
wait state clock periods need to be inserted; 
external circuits must disable RDY to insert a wait state, or enable RDY to avoid a wait 
state. INT A, all read controls, and advanced write controls provide this timing. The nor- 
mal write control is not valid until after RDY must be valid. Since both normal and 
advanced write controls are generated by the 8288 for all write bus cycles, the advanced 
write control may be used to generate 
a RDY indication, 
even though 
the selected 
device uses the normal write control. 


RQ/GT timing is covered later in this chapter. 
The only signals to be considered 
in the maximum 
mode are the queue status 
lines QSOand QS I. These signals change on the leading edge of each clock period (high- 
to-low transition), 
including idle and wait clock periods. The queue status indicates 
Execution Unit status, independent 
of the BIU activity. External logic may sample the 
lines on the low-to-high transition of each clock pulse. When sampled, the QSOand QSI 
signals identify queue activity in the previous clock period, and therefore lag the CPU's 
activity by one clock period. 
The TEST input requirements 
are identical to those stated for minimum 
mode. 


BUS CONTROL 
TRANSFER 


I(HOLD/HLDA 
AND RQ/GT) 


The 8086 has protocol signals that are used to transfer local bus control between 
the 8086 itself and other devices capable of acting as bus masters. The minimum 
mode 
configuration 
offers a single level handshake, 
identical to 8080A and 8085 systems. The 
maximum 
mode configuration 
has an enhanced 
pulse sequence 
protocol which makes 
more efficient use of CPU pins, while extending system configurations 
to two levels of 
alternate bus masters, with two levels of priority. These protocol signals arbitrate control 
of the 8086 local bus; they should not be confused with arbitration 
on a system bus. 


The minimum 
mode 8086 system uses a hold request input (HOLD) to the CPU 
and a hold acknowledge output (HLDA) from the CPU. To gain control of the local bus, 
a device must assert HOLD to the CPU and wait for the HLDA before driving the bus. 
When the 8086 can relinquish the bus, it floats the RD, WR, INT A, and MilO control 
lines, the DEN and DT/R bus control lines, and the multiplexed 
address/data/status 
fines. The ALE signal is not floated. The CPU acknowledges 
the request for the local 
bus with HLDA; this allows the requesting device to take control of the local bus. The 
requesting device must maintain the HOLD request active until it no longer requires the 
local bus. The HOLD request 
to the 8086 directly affects the bus interface 
unit; it 
indirectly affects the Execution Unit. The Execution Unit will continue to execute from 
its internal queue until either more instructions 
are needed, 
or an operand transfer is 
~equired. This allows a small degree of overlap between CPU and auxiliary bus master 
pperations. 
When the requesting master drops the HOLD signal, the 8086 will respond 
by dropping HLDA. The 8086 will not redrive the bus and control signals; these signals 
}ViIIcontinue to float until the 8086 needs to perform a bus transfer. Since the 8086 may 


{ 


till be executing from its internal queue when HOLD drops, there may exist a period of 
ime during which no device is driving the bus. To prevent the control lines from drift- 
ng below the minimum 
VIH level during a transition 
of bus control, 22K ohm pull-up 
~esistors should be connected to the bus control lines. The timing diagram in Figure 8- 
f5 shows the bus control handshake sequence in the 8086 timing to sample HOLD, float 
~he bus, and enable/disable 
HLDA relative to the CPU clock. 


To guarantee 
valid system operation, 
the designer must ensure that the request- 
ing device does not assert control of the bus prior to the 8086 relinquishing 
control - 
also, that the device relinquishes 
control of the bus prior to the 8086 driving the bus. 


The maximum 
delay between HLDA and the 8086 floating the bus is TCHDZ max - 


TCHCL min - TCLHAV min = 10 ns. If the system cannot tolerate the 10 ns overlap, 
HLDA active from the 8086 should be delayed to the device. The minimum delay from 
HOLD inactive until the 8086 drives control signals on the local bus is THVCH min + 
3TCLCL = 635 ns; to drive a multiplexed bus, this delay is THVCH min + 3TCLCL + 
TCHCL = 701 ns. If the device does not release the local bus within the specified time, 
HOLD inactive to the 8086 should be delayed. The delay from HLDA inactive to driv- 
ing the busses is TCLCL + TCLCH min - 
TCLHAV max = 158 ns for control signals 
on the local bus, and TCLCL - 
TCLHA V max = 240 ns for the local data bus. 


The decision to respond to a HOLD REQUEST is made by the bus interface unit. 


The major factors that influence the decision are current 
bus activity, the state of the 
LOCK signal internal to the CPU (activated by the software LOCK prefix), and pending 
interrupts. 


If the LOCK is not active, no interrupt acknowledge cycle is in progress, and the 
BIU (Bus Interface Unit) is executing a T4 or TI clock period when the HOLD request is 
received, the minimum 
latency to HLDA is: 


35 ns 
65 ns 
200 ns 
10 ns 


THVCH min (Hold setup) 
TCHCL min 
TCLCL (Bus float delay) 
TCLHAV min (HLDA delay) 


@ 5 MHz 


34 ns 
200 ns 
82 ns 
200 ns 
160 ns 


(Just missed set up time) 
Delay to next sample 
TCHCL max 
TCLCL (Bus float delay) 
TCLHAV max (HLDA) 


If the BIU just initiated a bus cycle when the Hold Request was received, the worst 
case response time is: 


34 ns 
82 ns 
7"200 
N"200 
160 ns 


THVCH (Just missed) 
TCHCL max 
Bus cycle execution 
N wait states/bus 
cycle 
TCLHAV max (HLDA delay) 


Note that 200 ns for missing the Hold Request 
is included in the delay for bus 
cycle execution. 
If the operand transfer is a word transfer to an odd byte boundary, 
two 
bus cycles are executed to perform the transfer. The BIU will not acknowledge a Hold 
Request between the two bus cycles. This type of transfer would extend the above max- 
imum latency by four additional clock periods, plus N additional wait states. With no 
wait states in the bus cycle, the maximum 
would be 2.476 microseconds. 
Although the minimum 
mode 8086 does not have a hardware LOCK output, the 
software LOCK prefix may still be included in the instruction 
stream. The CPU inter- 
nally reacts to the LOCK prefix in the same manner 
that the maximum 
mode 8086 
would. Therefore 
LOCK does not allow a Hold Request to be honored until completion 
of the instruction 
following a prefix. In consequence, 
instructions 
which perform more 
than one memory 
reference, 
such as ADD (BX), CX, which adds CX to (BX) then 
stores the result in (BX), can execute without another bus master gaining control of the 
bus between memory references. 
Since the LOCK signal is active for one clock period 
more than instruction 
execution, 
the maximum 
latency to HLDA is: 


34 ns 
200 ns 
82 ns 
1M + 1)'200 
ns 


200 ns 
160 ns 


THVCH (Just missed) 
Delay to next sample 
TCHCL max 
LOCK instruction execution 
Set up HLDA IInternall 
TCLHAV max IHLDA delay) 


@ 5 MHz 
m is the number of clocks to execute 
the locked instructions 


If the 
Hold Request 
is made at the beginning 
of an interrupt 
acknowledge 
sequence, 
the maximum 
latency to HLDA is: 


34 ns 
82 ns 
2600 
ns 


160 ns 


THVCH (Just missedl 
TCHCL max 
13 clock cycles for INTA 
TCLHAV max 


A typical use of the minimum 
mode HOLD/HLDA 
signals is to exchange 
bus 
control with DMA controller devices such as the Intel 8257-5 or 8237 DMA controllers. 
Figure 8-36 functionally 
illustrates this type of configuration, 
using the 8257-5. 


The DMA controller 
resides on the upper half of the 8086's local multiplexed 
address data bus; it shares the A15-A8 demultiplexing 
address 
latch with the 8086. 


8257-5 registers 
must 
be accessed over the upper half of the bus. Therefore, 
odd 
addressed 
registers 
(AO= 1) are accessed with byte transfers 
to an odd I/O address, 


while even 
addressed 
registers 
are accessed 
via word I/O, 
with the expected 
data 
transferred 
in the upper byte. The 8086 read and write control signals must be demulti- 
piexed to provide separate 
I/O and memory controls that are compatible 
with 8257-5 
requirements. 
The AEN control from the 8257-5 must disable the 8086 control signals 
and the lower (A7-AO) and upper (A19-A16) 
address bus latches. Also, AEN must 
select the 8257-5 address strobe (ADSTB) for the A15-A8 address latch. If the data bus 
is buffered, a pull-up resistor on the DEN line will keep the buffers disabled. The DMA 
controller 
will only transfer 
bytes between 
memory 
and I/O; the DMA 
controller 
requires that the I/O devices reside on an 8-bit bus derived from the 16-bit to 8-bit bus 
multiplex circuit shown below. Address lines A7-AO are driven directly by the 8257 and 
BHE is generated 
by inverting AO. If A19-A16 are used, they must be provided by an 
additional port with either a fixed value or a value that is initialized by software; this 
additional port must be enabled onto the address bus by AEN. 


Figure 8-37 illustrates the 8257 connected 
to the system bus. 
By using a separate latch to hold the upper address from the 8257-5, while outputs 
are connected to the address bus as shown, 16-bit DMA transfers are provided. In this 
configuration, 
AEN simultaneously 
enables AO and BHE to allow word transfers. AEN 
still disables the CPU interface to the control and address busses. 
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The maximum 
mode 8086 configuration 
supports 
a significantly 
different 
bus 
control transfer protocol. 


The maximum 
mode RQ/GT sequence is used to transfer control of the local bus 
between the 8086 and alternative 
bus masters, such as DMA controllers, 
which reside 
totally on the local bus and share the complete CPU interface to the system bus. The 
complete 
CPU 
interface 
to 
the 
system 
bus 
includes 
the 
address 
latches, 
data 
transceivers, 
8288 bus controller and 8289 multimaster 
bus arbiter. If the alternate bus 
masters in the system do not reside directly on the 8086 local bus, then system bus 
arbitration 
is required, 
and local bus arbitration 
will not do. Multimaster 
system bus 
arbitration 
requires the 8289 bus arbiter; RQ/GT logic cannot be used. 
If a device with a simple HOLD/HLDA 
protocol is to gain control of a system bus 
with just one connected 
CPU, the ~ircuit in Figure 8-38 could be used. 
This circuit is, in effect, a simple bus arbiter that isolates the CPU from the system 
bus when an alternate bus master issues a Hold Request. The output of the circuit, AEN 
(Access ENable) disables the 8288 and 8284 when the 8086 indicates idle status (SO,sr, 
S2 = 1), LOCK is inactive and a Hold Request is active. With AEN inactive, the 8288 
floats the control outputs 
and disables DEN, which floats the data bus transceivers. 


AEN must also float the address latch (8282 or 8283) outputs. These actions remove the 
8086 from the system bus and allow the requesting device to drive the system bus. The 
AEN signal to the 8284 disables the READY input and forces the bus cycle initiated by 
the 8086 to wait until the 8086 regains control of the system bus. The CPU may actively 
drive its local bus during this interval. 


The requesting device will not gain control of the system bus during an 8086-initi- 
ated bus cycle, a locked instruction's 
execution or an interrupt 
acknowledge cycle. The 
LOCK signal from the 8086 is active between INT A cycles to guarantee 
that the 8086 
maintains 
control of the bus. Unlike the minimum 
mode 8086 HLDA response, 
the 
requesting 
master 
can gain control 
of the bus between 
consecutive 
bus cycles that 
transfer a word operand on an odd address boundary. 
Depending on the characteristics 
of the requesting 
device, one of the other 74LS74 outputs could be used to generate a 
HLDA to the device. This would be useful when interfacing to a device that requires 
some delay before it uses the bus. 


Upon completion 
of its system bus operations, 
the alternate 
bus master 
must 
relinquish 
control of the system bus and drop the HOLD request. 
After AEN goes 
active, the address latches and data transceivers are enabled, but if an 8086 initiated bus 
cycle is pending, the 8288 will not drive the control lines until a minimum delay of 105 
ns or a maximum 
delay of 275 ns has elapsed. If the system is normally not ready, the 
8284 AEN input may be enabled immediately, 
with READY returning to the 8086 when 
the selected device completes 
the transfer. 
If the system is normally ready, the 8284 
AEN input must be delayed long enough to provide access time equivalent to a normal 
bus cycle. The 74LS74 latches in the design provide a minimum 
of TCLCH ns for the 
alternate 
devices 
to float the system bus after releasing 
HOLD. They also provide 
2TCLCL ns address access and 2TCLCL - 
TAEVCH max ns (8288 command 
enable 
delay) control access prior to enabling 8284 READY detection. If HLDA is generated as 
shown in Figure 8-38, TCLCL ns are available for the 8086 to release the bus prior to 
issuing HLDA, while HLDA is dropped almost immediately 
upon loss of HOLD. 
The circuit configuration 
for an 8257-5 using this technique 
to interface to the 
maximum mode 8086 can be derived from Figure 8-37. The 8257-5 has its own address 
latch to buffer the address lines AI5-A8; the 8257-5 uses its AEN output to enable the 
latch onto the address bus. The maximum latency from HOLD to HLDA for this circuit 
is dependent 
on the state of the system when the HOLD is issued. For an idle system, 
the maximum latency is the propagation delay through the NAND gate and R/S flip-flop 
(TD!) + 2TCLCL + TCLCH max + the propagation delay of the 74LS74 and 74LS02 
(TD2). 
For a locked instruction 
it becomes TDI + TD2 + (M + 2) * TCLCL + 
TCLCH max where M is the number 
of clocks required 
for execution 
of the locked 
instruction. 
For the interrupt 
acknowledge 
cycle the latency is TD 1 + TD2 + 9 * 
TCLCL + TCLCH max. 


The RQ/GT 
protocol was developed 
to allow one or two other instruction 
set 
extension 
processors 
(co-processors) 
or special function processors to connect directly 
with the 8086 local bus. Each 8086 RQ/GT pin supports the full protocol for exchange of 
bus control. 


The bus control exchange sequence consists of a request from the alternate 
bus 
master to gain control of the local bus, a grant from the 8086 to indicate that the local 
bus has been relinquished, 
and a release pulse from the alternate bus master when done. 


The two RQ/GT pins (RQ/GTO and RQ/GTl) 
are prioritized, with RQ/GTO having the 
higher priority. Priorities are meaningful only when requests are received on both pins, 
before a response has been given to either. For example, if a request is received on RQ/ 
GTl, followed by a request on RQ/GTO prior to a grant on RQ/GTl, 
then RQ/GTO will 
gain priority over RQ/GTl. 
If, however, 
RQ/GTl 
had already been granted priority, a 
request on RQ/GTO must wait until a release pulse is received on RQ/GTl. 
The request/grant 
interaction 
sequence 
with a bus interface 
unit is similar to 
HOLD/HLDA. 
The 8086 continues 
to execute 
instructions 
taken from its internal 
queue until it requests a bus cycle, to fetch an instruction, 
or to process an operand. But 
if the release pulse is received before the 8086 needs the bus, it will not drive the bus 
until a bus cycle is required. 


Upon receipt of a request pulse, the 8086 floats the multiplexed address/data 
bus, 
the SO,ST, and S2 status lines, the LOCK pin and RD. This action does not disable the 
8288 control outputs nor does it disable the address latches, which continue to drive the 
address bus. The 8288 contains internal pull-up resistors on the SO,Sf, and S2 status 
lines to maintain the passive state while the 8086 outputs are floated. The passive state 
prevents the 8288 from initiating any control outputs or activating DEN to enable the 
transceivers buffering the data bus. If the device issuing the RQ does not use the 8288, it 
must disable the 8288 control outputs by disabling the 8288 AEN input. Also, address 
latches not used by the requesting device must be disabled by the requesting 
device. 


0~ 
I 
I 
I 
I 
I 
~~ 
I 
II 


G) 
The 8086 
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® 
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CD 
The other master floats S2, ST, SO from passive state on this edge 
o 
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Bus, BHE, and LOCK on this edge 
o 
The 8086 
redrives the control lines 
@ 
The 8086 
redrives the ADDRESS/STATUS/DATA 
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"F/GT Operation 
__ 


Detailed timing of the RQ/GT sequence 
is given in Figure 8-39. 


To request a transfer of bus control via the RQ/GT lines, a device must drive the 


I ne low for no more than one CPU clock period. This constitutes 
a request pulse. The 


equest pulse must be synchronized 
with the CPU clock to guarantee appropriate set-up 


nd hold times, relative to the 8086 clock edge which samples the RQ/GT lines. After 
i suing a request pulse, the device must begin sampling for a grant pulse, beginning with 


e next low-to-high clock edge. Since the 8086 can respond with a grant pulse in the 
lock period immediately 
following a request, 
the RQ/GT 
line may not return to the 


ositive level between the request and grant pulses. Therefore, 
edge trigger logic cannot 
apture a grant pulse. It is also necessary that the circuitry which generates the request 
,ulse guarantee 
that the request is removed 
in time to detect a grant from the CPU. 


fter receiving the grant pulse, the requesting device may drive the local bus. The 8086 
oes not float the address or data bus, LOCK or RD until the clock edge, which the 
equesting 
master 
uses to start looking for a grant. Therefore 
the requesting 
master 
hould wait the float delay time of the 8086 (TCLAZ address float or TCHDZ data float) 
efore driving the local bus. This precaution prevents bus contention while the request- 
i g master gains local bus access. 


To return local bus control to the 8086, the alternate bus master issues a release 
ulse on the same RQ/GT line. The 8086 may drive the SO-S2 status lines three clock 
ycles after detecting 
the release 
pulse. The 8086 may drive the address/data 
bus 
CHCL ns (clock high time) after the status lines are driven. The alternate bus master 
ust be floated off the local bus and must reenable other interface circuits, such as the 


~288 and address latches, by the time the 8086 regains control of the bus. The request- 
irg device may not issue a release pulse until at least one clock cycle after receiving the 
~rant pulse, and must not issue a new request until at least one clock cycle after a pre- 
~ious release pulse. 


The RQ to GT delay for a single RQ/GT line is similar to the HOLD to HLDA 
delay. The cases given for a minimum 
mode 8086 also apply to the maximum 
mode. In 
each case, the delay from RQ detection by the 8086 to GT detection by the requesting 
master is: (HOLD to HLDA delay) - 
(THVCH + TCHCL + TCLHAV). This gives a 
clock period maximum 
delay for an idle bus interface. In all other cases, the delay is 
equal to the minimum 
mode result minus 476 ns. If the 8086 has previously issued a 
grant on one of the RQ/GT lines, a request on the other RQ/GT line will not receive a 
grant until the first device releases the interface with a release pulse on its RQ/GT lines. 
The delay from release on one RQ/GT line to a grant on the other is typically one clock 
period as shown in Figure 8-40. 
Occasionally, the delay from a release on RQ/GTI 
to a grant on RQ/GTO will take 
two clock periods, and is a function of any pending request for transfer of control from 
the 8086 execution 
unit. The delay from request to grant when the interface is under 
control of a bus master on the other RQ/GT line is a function of the other bus master. 
The protocol embodies 
no mechanism 
whereby the 8086 can force an alternate 
bus 
master off the bus. To ensure that an errant alternate bus master does not "hang" 
the 
system, a watchdog timer should be employed. 


A circuit that translates a HOLD/HLDA 
handshake 
sequence into a RQ/GT pulse 
sequence is given in Figure 8-41. 


After receiving the grant pulse, the HLDA is enabled TCHCL (min) ns before the 
8086 disconnects 
itself from the local bus. If the requesting circuit drives the bus within 
20 ns of HLDA, it may be desirable to delay the acknowledge pulse by one clock period. 
The HLDA 
is dropped 
no later than one clock period after HOLD is disabled. 
The 
HLDA also drops at the beginning of the release pulse to provide 2TCLCL + TCLCH 
for the requesting 
master to relinquish control of the status lines, and 3TCLCL to float 
through 
remaining signals. 
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The 
Multibu5 


The Multibus is a general purpose multiprocessing 
system bus. This standard bus 
has mechanical, 
electrical, and form specifications. 
The Multibus is used in Intel iSBC 
single board microcomputer 
products. Multibus-compatible 
products are also offered by 
other manufacturers. 
Anyone designing multiprocessing 
systems should consider build- 
ing his or her systems around the Multibus for two important 
reasons: 


1. 
To save the time and costs associated with developing a new system; 


2. 
To gain compatibility 
with a wide variety 
of products 
available 
for the 
Multibus. 


When the 8086 is configured in the maximum 
mode, the 8288 bus controller and 8289 
bus arbiter provide a bus access and control interface that is fully compatible with the 
electrical and A.C. characteristics 
of the Multibus system bus. When configured in the 
minimum mode, the 8086 can operate easily on the Multibus (albeit with some external 
logic to encode appropriate 
signals), 
unless a multiprocessor 
system is desired. 
In all 


multiprocessor 
systems, the 8086 should be configured in the maximum 
mode. 
The Multibus 
provides a versatile communications 
channel that can be used to 
coordinate 
a wide variety of computing 
modules. 
Modules 
in the system are either 
masters or slaves. Masters obtain use of the bus and initiate data transfers, while slaves 
merely perform data transfers. The bus allows both 8-bit and 16-bit masters and slaves 
to be intermixed 
in the system. The bus supports 16 data lines, 20 address lines, 8 inter- 
rupt lines, plus control and bus arbitration 
lines. Other lines contain 
power busses, 
power backup, 
and power sense signals for switching 
memories 
to battery 
backup 
systems. A complete listing of pin assignments 
on the Multibus is given in Tables 9-1 
and 9-2. A functional description 
of the signals follows. 


{Component 
Side} 
{Circuit 
Sidel 
Pin 
Pin 
Mnemonic· 
Description 
Mnemonic 
Description 


1 
GND 
Signal GND 
2 
GND 
Signal GND 


'" 
3 
+5 V 
+5 Vdc 
4 
+5 V 
+5 Vdc 
~ " 
" .- 
5 
+5 
+5 Vdc 
6 
+5 
+5 Vdc 
3:0. 
o 
0. 
7 
+12 
V 
+12 
Vdc 
8 
+12 
V 
+12 
Vdc 
0. 
::> 
en 
9 
-5 
V 
-5 
Vdc 
10 
-5 
V 
-5 
Vdc 


11 
GND 
Signal GND 
12 
GND 
Signal GND 


13 
BCLKI 
Bus Clock 
14 
INITI 
Initialize 


'" 
15 
BPRNI 
Bus Priority In 
16 
BPROI 
Bus Priority Out 
g 
17 
BUSYI 
Bus Busy 
18 
BREOI 
Bus Request 


'" c: 
19 
MRDCI 
Memory Read Command 
20 
MWTCI 
Memory Write Command 


::> 
0 
21 
10RCI 
I/O Read Command 
22 
10WC/ 
I/O Write Command 
alU 


23 
XACKI 
Transfer Acknowledge 
24 
INH1' 
Inhibit 1 disable RAM 


'" '" 
25 
Reserved 
26 
INH21 
Inhibit 2 disable PROM or ROM 
- '" 
~~ 
27 
BHEN/ 
Byte High Enable 
28 
AD101 
c:." 
29 
CBRO/ 
Common Bus Request 
30 
AD11/ 
8<l: 
Address Bus 
"'." 
31 
CCLKI 
Constant Clock 
32 
AD121 


::> c: 
33 
INTAI 
Interrupt Acknowledge 
34 
AD131 
al 
'" 
:l 
35 
INT61 
36 
INT7/ 


0. 
37 
INT4 
3B 
INT5/ 
;: 
39 
INT21 
Parallel Interrupt Requests 
40 
INT3/ 
Parallel Interrupt Requests 
~.= 
41 
INTO 
42 
INT1/ 


43 
ADRE/ 
44 
ADRF/ 


45 
ADRC/ 
46 
ADRD/ 


'" 
47 
ADRAI 
48 
ADRBI 


'" 
49 
ADR81 
50 
ADR91 
" 
Address Bus 
Address Bus 
-c 
51 
ADR61 
52 
ADR7/ 
."<l: 
53 
ADR41 
54 
ADR51 


55 
ADR2/ 
56 
ADR3/ 


57 
ADRO/ 
58 
ADR1/ 


59 
DATE/ 
60 
DATF/ 


61 
DATCI 
62 
DATD/ 


63 
DATA/ 
64 
DATB/ 


~ 
65 
DAT8/ 
Data Bus 
66 
DAT9/ 


'" 
67 
DAT61 
68 
DAT7I 
Data Bus 
e 


69 
DAT41 
70 
DAT5/ 


71 
DAT21 
72 
DAT31 


73 
DATOI 
74 
DATlI 


75 
GND 
Signal GND 
76 
GND 
Signal GND 


'" 


77 
Reserved 
78 
Reserved 
~ " 
79 
-12 
V 
-12 
Vdc 
80 
-12 
V 
-12 
Vdc 
" .- 
3:0. 
81 
+5 V 
+5 Vdc 
82 
+5 V 
+5 Vdc 
o 
0. 


0. 
::> 
83 
+5 V 
+5 Vdc 
84 
+5 V 
+5 Vdc 
en 


85 
GND 
Signal GND 
86 
GND 
Signal GND 


All Mnemonics 
© Intel Corporation 
1978 


·Two notations for negative true (active low) signals are used in this book: 
a bar over the signal name. or a slash after the signal name le.g.• BUSY = BUSY/). 


(Component 
Side) 
(Circuit 
Sidel 
Pin 
Pin 
Mnemonic 
Description 
Mnemonic 
Description 


1 
GND 
Signal GND 
2 
GND 
Signal GND 
3 
VCCB 
+5 V Battery 
4 
VCCB 
+5 V Battery 


5 
Reserved 
6 
VCCPP 
+5 V Pulsed Power 
7 
VBBB 
-5 
V Battery 
8 
VBBB 
-5 
V Battery 
9 
Reserved 
10 
Reserved 
+ 


11 
VDDB 
+ 12 V Battery 
12 
VDDB 
+ 12 V Battery 
13 
PFSRI 
Power 
Fail 
Sense 
Reset 
14 
Reserved 
+ 


15 
VAAB 
- 12 V Battery 
16 
VAAB 
- 12 V Battery 
17 
PFSNI 
Power Fail Sense 
18 
ACLO 
AC Low 


19 
PFINI 
Power Fail Interrupt 
20 
MPROI 
Memory Protect 
21 
GND 
Signal GND 
22 
GND 
Signal GND 


23 
+15 
V 
+15 
V 
24 
+15 
V 
+15 
V 


25 
-15 
V 
-15 
V 
26 
-15 
V 
-15 
V 


27 
PAR11 
Parity 1 
28 
HALTI 
Bus Master HALT 
29 
PAR21 
Parity 2 
30 
WAITI 
Bus 
Master 
WAIT 


STATE 


31 
32 
ALE 
Bus Master ALE 
33 
34 
Reserved 


35 
36 
Reserved 


37 
38 
AUX RESETI 
Reset switch 
39 
40 
41 
42 
43 
'> Reserved 
44 


45 
46 


47 
48 
Reserved 
49 
50 


51 
52 


55 
56 


57 
58 


59 
60 


Notes: 


1. 
PFIN. on slave modules. if possible. should have the option of connecting 
to INTOI on P1. 


2. 
All undefined pins are reserved for future use. 


I 
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The Initialization 
signal resets the entire system to a predetermined 
state. INIT 
may be supplied by one of the bus masters or by external logic. 


The 20 address lines are used to transmit the address of the memory location or 
110 port to be accessed. ADR13 is the most significant bit, while ADRO is the least sig- 
nificant bit. 8-bit bus masters use 16 address lines (ADRO-ADRF) 
to address memory, 


and 8 address 
lines (ADRO-ADR 7) to select I/O ports. 
16-bit bus masters 
address 
memory via all 20 address lines and select I/O ports via the low-order 12 address lines 
(ADRO-ADRB). The 8088 however can use all 20 address lines even though considered 
an 8-bit bus CPU. 


The Inhibit RAM signal prevents RAM memory devices from responding to the 
address 
on the address 
bus. INHI 
allows ROM memory 
devices to override 
RAM 
devices when ROM and RAM memory are assigned the same memory space. 


The Inhibit ROM signal prevents ROM memory devices from responding 
to the 
address on the Address 
Bus. INH2 allows auxiliary ROM to override 
ROM devices 
when ROM and auxiliary ROM memory are assigned the same memory space. 
INHI and INH2 may also be used to allow memory mapped I/O devices to over- 
ride RAM and ROM devices respectively. 


BHEN is used to specify that data will be transferred 
on the high-order 8 data lines 
of the Multibus. 
This signal is used in systems 
that utilize 
16-bit memory 
or I/O 
modules. 


The 16 bidirectional 
data lines are used to exchange information 
with a memory 
location or I/O port. DATF is the most significant bit, although 
in 8-bit systems only 
lines DATO-DAT? 
are used, and DAT? becomes 
the most significant bit. DATO is 
always the least significant bit. 


The negative edge of the Bus Clock is used to synchronize 
bus contention. 
BCLK 
is asynchronous 
with the CPU clock. BCLK may be slowed, stopped, or single-stepped 
during debugging. 


The Bus Priority In signal tells a bus master 
that no higher priority device is 
requesting 
use of the system bus. BPRN is synchronized 
with BCLK. This signal is 
"daisy 
chained" 
if you use serial priority arbitration. 
When 
using parallel priority 
arbitration, 
a bus arbiter generates BPRN. 


This is a Bus Priority Out signal. Like BPRN, BPRO is "daisy-chained" 
when 
serial priority arbitration 
is used; BPRO is fed to the BPRN input of the next lower 
priority module. When using parallel priority arbitration, 
a bus arbiter must provide this 
signal. BPRO is synchronized 
with BCLK. 


The Bus Busy signal is supplied by the current 
bus master to indicate that the 
system bus is in use. BUSY is used by other devices to determine 
whether or not they 
may acquire control of the system bus. BUSY is synchronized 
with BCLK. 


The Bus Request signal is used by devices to indicate that they wish to become bus 
master. BREQ is synchronized 
with BCLK; it is not bussed on the motherboard. 


CBRQ is used by all potential bus masters to inform the current bus master that 
another master wishes to use the bus. If CBRQ is high, the current bus master knows 
that no other device is requesting 
the bus and therefore 
the present bus master is to 
retain the bus. 


A bus master that has control of the system bus generates all data transfer control 
signals. All address signals (and data signals when a write is to occur) must be stable at 
least 50 ns prior to the transfer control signal pulse and must remain valid for at least 50 
ns after the control signal pulse is removed. 
Information 
transfer protocol lines are not synchronous 
with BCLK. 


The Memory Read Control indicates that the address of a memory location has 
been placed on the address lines, and that the contents of the address location are to be 
placed on the data lines. 


The Memory Write Control indicates that the address of a memory location has 
been placed on the address lines and that data has been placed on the system data lines; 
the data is to be written into the addressed memory location. 


The I/O Read Control indicates that the address of an input port has been placed 
on the system address lines, and that the data at that input port is to be placed on the 
data lines. 


The I/O Write Control indicates that the address of an output port has been placed 
on the system address lines and that the data has been placed on the system data lines; 
the data is to be output to the addressed 
port. 


All exchanges 
involve handshaking. 
Therefore 
the selected bus slave must pro- 
vide the bus master with an acknowledge signal in response to the transfer control sig- 
nal. The Transfer 
Acknowledge signal is the required response that indicates that the 
specified operation has been completed. 


The Advanced Acknowledge signal is used by 8080A microprocessors. 
AACK is 
an advance acknowledge that allows the CPU to complete a specified operation without 
entering a Wait state. Bus slaves that provide AACK must also provide XACK. This 
requirement 
must be met since not all bus masters will respond to the AACK signal. 


These eight priority interrupt request lines are used with parallel interrupt resolu- 
tion circuitry.1NTi 
has the lowest priority, INTO the highest priority. 


INT A is used by a bus master to request that external logic place interrupt 
vector 
information 
on the Multibus data lines. 


Various regulated power supply lines are provided on the bus. Each module must 
provide both bulk decoupling and high frequency decoupling local to the resident logic 
devices. 


Reserved 
lines should not be used; they must be left available for future 
Intel 
definition. 
The Multibus is logically similar to the 8086 demultiplexed 
bus. An address is pro- 
vided on address lines ADRO through ADRI3. 
(The address line number is now a hex 
number.) 
The Multibus requires a delay from valid address of 50 ns before a Read Control 
signal can be transmitted 
to a selected device. The read control pulse must be at least 
100 ns wide, and the address must remain stable at least 50 ns after the Read Control 
signal terminates. 
If the selected device requires more than the 100 ns read signal or the 
150 ns minimum 
specified address access time, then the device may extend the read 
cycle by using the XACK (transfer acknowledge) signal. This signal is equivalent to the 
Ready signal connected 
to the 8284 RDY input. XACK is normally "not ready"; 
it is 
driven active to tell the CPU that the device is ready to receive or transmit data and to 
allow termination 
of the bus cycle. The Multibus specifies data setup and hold times 
relative to the XACK signal, rather than the Read or Write Control signal, to allow 
autonomous 
operation 
of the selected 
device in a multiple - processor 
system 
with 
mixed CPU types. 


The write bus cycle is similar to the read bus cycle. Written data must be valid a 
minimum 
of 50 ns prior to the Write Control signal and must be held valid a minimum 
of 50 ns following the Write Control signal. 


Master modules attached 
to the Multibus must not violate the minimum 
setup 
and hold times or control pulse widths. Many designs provide better than minimum 
margins when running at their maximum 
band width. Slave modules must be able to 
tolerate the minimum 
setup and hold times but may extend the access times if they 
delay return of XACK by an appropriate amount. 
The Multibus provides two basic interrupt 
handling methods. 
These are: 


1. 
A method whereby interrupt 
vectors are not transferred 
on the bus. Rather 
they are generated 
by the bus master's 
interrupt 
controller. 
The slave that 
requests the interrupt 
must be part of the same module as the bus master. If 
the interrupting 
slave is part of another 
module, 
then the slave will use the 
Multibus 
interrupt 
request lines (INTO-INT7) 
to request an interrupt; 
this 
interrupt 
will be processed by the bus master's 
interrupt 
controller. 


A method where the interrupt vector is transferred 
on the bus. When a slave 
device requests an interrupt, 
interrupt 
control logic interrupts 
the processor. 


The processor 
acknowledges 
the interrupt 
by lowering the INT A line and 
locking the system bus. This allows an interrupt 
vector to be transferred. 
Following 
the 
initial INT A cycle, interrupt 
control 
logic determines 
the 
address of the highest priority slave currently 
requesting 
an interrupt. 
This 
address 
is placed on the address 
bus. The addressed 
slave responds 
by 
transmitting 
an interrupt 
vector address back to the master. 


In addition to providing a standard asynchronous 
data transfer protocol and tim- 
ing specifications for designing master and slave modules, the Multibus provides a stan- 
dard 
protocol 
which 
multiple 
masters 
use 
to 
exchange 
bus 
control. 
To 
allow 
asynchronous 
masters 
to share the bus, the Multibus 
maintains 
its own clock signal 
independent 
of clock signals local to modules that might connect to the Multibus. The 
Multibus clock signed BCLK synchronizes 
asynchronous 
requests for bus access. This 
allows arbitration logic to resolve priorities and grant access to one master at a time. This 
bus arbitration technique allows masters operating at various speeds to compete equita- 
bly for use of the system bus. When a master gains control of the system bus, however, 
transfer speeds depend only on the capabilities of the master and its associated slave 
modules, 
not on BCLK, the Multibus clock signal. The maximum 
bus band width is 5 
MHz, taking into account minimum address setup and hold times and minimum control 
pulse width. When one considers the additional overhead of bus arbitration and typical 
memory response times, the actual transfer speeds are often on the order of 2 MHz. 


The Multibus 
allows serial or parallel priority arbitration 
among bus masters. 


Serial priority technique 
is illustrated in Figure 9- I. 
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The highest 
priority master 
has BPRN grounded. 
The priority enable 
output 
BPRO from each master is connected 
to the priority input BPRN of the next lowest 
priority master. If that master does not need the bus, it propagates its BPRN to BPRO. 
This propagates upstream priority to downstream 
masters. A master that needs the bus 
will output BPRO high; this denies priority to lower level masters. This logic daisy chains 
the priority enable from the highest priority master in the system to the lowest. Since a 
higher priority master must wait until the current 
bus master has completed 
any bus 
cycles already in progress, another signal must be included to indicate an idle or "not 
busy" status of the Multibus. 
The BUSY line, which serves this function, 
is a signal 
common to all bus masters. Each master using the Multibus clocks BUSY to determine 
its ability to gain access to the bus. 


When sampled on a high-to-Iow transition of BCLK, if BPRN is low (active) and 
BUSY is high (inactive), 
indicating an idle bus, then the current 
master must drive 
BUSY low prior to the next high-to-Iow transition of BCLK to prevent a higher priority 
master from gaining control of the bus before the current master completes its transfer. 


If a current 
master 
loses priority, 
it must 
release 
BUSY after completing 
its 
transfer and float its connection 
to the MuItibus. 


If another master wishes to request the bus, it must disable lower priority masters 
before the high-to-Iow 
transition 
of BCLK. Otherwise 
race conditions 
will occur in 
priority resolution 
circuits and lower priority masters. 
Also, since the priority disable 
from a higher priority master 
must propagate 
through 
the daisy chain to the lowest 
priority master, the total propagation delay from highest to lowest priority master must 
not exceed one BCLK clock period. This places an upper bound 
on the number 
of 
masters the serial priority arbitration 
can accommodate. 
The parallel priority arbitration is similar to serial priority arbitration, 
except that 
each master issues a bus request, BREQ, on the high-to-Iow transition of BCLK, and an 
external 
user-defined 
circuit resolves 
priorities. 
The priority resolving 
circuit 
must 
resolve and return stable priority inputs, BPRN's, back to each master within one period 
of BCLK. A parallel resolving circuit and an example of bus exchange timing are given 
in Figure 9-2. 
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The 
Multibus 
architecture 
provides 
a 
well-defined 
bus 
structure 
for 
multiprocessor 
systems. 
The bus serves 
as a means 
of sharing 
resources 
between 
multiprocessors 
and communicating 
between the processors 
in the system. There are 
two elementary 
forms of multiprocessor 
systems. They are: 


1. 
A tightly coupled system. In a tightly coupled system, 
multiple processors 
communicate 
with each other by passing information 
through 
a common 
memory space. 


2. 
A loosely coupled system. In a loosely coupled system, multiple processors 
communicate 
with each other by passing information 
via the 110 structure. 
Typically, a serial communications 
link is used. However, in some cases infor- 
mation is passed via a mass storage device such as a high-speed disk. 


The Multibus is designed to satisfy the requirements 
of tightly coupled systems, 
while still allowing multi-CPU processing systems to be loosely coupled via standard I/O 
communication 
protocols like Bisync or HDLC. The mechanisms 
for sharing the bus 
consist of a combination 
of the Multibus protocol and user-defined 
priority resolution 
circuitry. The Multibus provides the basic controls for each processor to request the bus, 
to receive an acknowledgement 
when the processor has bus priority, and to indicate bus 
availability. The user must select the priority resolution 
technique 
most applicable to 
any task. Be cautious 
when selecting a priority system, 
otherwise 
the Multibus 
may 
become 
a system 
bottleneck, 
degrading 
overall system 
performance. 
To ensure 
an 
equitable 
utilization 
of the bus, the MUL TIBUS supports 
a common 
bus request, 


CBRQ. This signal allows low priority devices to request the bus from a higher priority 
bus master or to indicate a pending lower priority request. This allows the current bus 
master to maintain control of the bus until the higher priority master forces the current 
master off the bus (via the loss of priority) or a lower priority master requests the bus 
(via common 
bus request). 
The response 
by the current 
master 
to a common 
bus 
request is user-definable. 
It may cause the master to release the bus at the end of the 
current 
transfer 
or immediately 
if there is no bus activity, or the master may simply 
ignore the request and only relinquish bus control to higher priority masters. The com- 
mon bus request gives the designer another level of control when defining the utiliza- 
tion of the system bus. In consequence 
the current master is not forced to release and 
re-request 
the bus for every transfer. This minimizes the overhead associated with bus 
access and transfer. 


The Multibus BUSY signal allows the current master to maintain bus control even 
though a higher priority master has forced it to lose priority (forced its BPRN signal 
inactive). This is necessary for operations requiring multiple bus cycles, where the cur- 
rent master's 
bus cycles cannot be separated 
by bus cycles executed 
for another 
bus 
master. If the current bus master has control of the bus, it may continue to control the 
bus by not releasing 
BUSY to indicate that the bus is available. 
This capability is 
required 
for multiple bus cycle interrupt 
acknowledge 
sequences 
and for test and set 
operations. 
It is a function of the master module to interpret those conditions for which 
it must not relinquish control of the bus. When these conditions arise, the master must 
ignore the loss of BPRN until the required operation 
is complete. 
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The Multibus is a demultiplexed 
system bus with separate address and data lines. 
The system maintains a one megabyte address space via 20 address lines, and there is a 
16-bit data bus. For compatibility 
with existing Multibus-compatible 
modules, 
which 
only maintain an 8-bit data bus, all 8-bit transfers are performed over the least signifi- 
cant half of the data bus regardless of the address. Only 16-bit transfers use the full 16 
data lines. This departs from the standard 8086 procedure of high and low byte transfers 
on the upper and lower halves of the data bus. As a result, byte transfers to existing 8-bit 
interface slave modules are independent 
of the CPU. Slave modules supporting 
16-bit 
interfaces are accessible, a byte at a time, by master modules that only interface to an 8- 
bit data bus, yet slaves can be accessed a word at a time or a byte at a time by masters 
that support the full 16-bit data bus. The Multibus signal BHEN provides the additional 
information 
required to determine 
whether a byte or a word transfer is being performed. 


Figure 9-3 shows how BHEN controls the gating of byte and word information 
to and 
from the Multibus system bus. 
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Multiprocessor Configurations 


for the 8086 


The 8086 family of components 
was devised to provide a system architecture 
that 
would allow for both single processor and multiprocessor 
configurations. 
This chapter 
wil1 describe 
how maximum 
mode is used in a multiprocessor 
system; 
we will also 
examine various configurations 
supported 
by the 8086 family of components. 
The multiprocessing 
capabilities of the 8086 family are based on these two sepa- 
rate and distinct features: 


1. 
Special function 
processors 
which reside on the local bus and enhance 
the 
basic architecture 
of the 8086 CPU. 


2. 
Multiple CPUs sharing a common system bus. 


A special function 
processor is dedicated to one CPU; it executes extensions 
to 
the CPU's instruction 
set and processes in parallel with this CPU. Special function pro- 
cessors are therefore referred to as co-processors. 
A multiple microprocessor 
system, in 
contrast, 
more 
closely resembles 
the classical multi-CPU 
environment 
where 
the 
interaction 
of CPUs 
is user 
defined. 
Note 
that 
a system 
may combine 
the 
two 
capabilities, i.e., each CPU in a multiprocessor 
system may have its own unique dedi- 
cated 
co-processors. 
Multi-CPU, 
multiprocessor 
systems' 
needs 
are 
met 
by the 
Multibus defined in Chapter 9. 


The 8086 has hardware and software characteristics 
aimed at enabling co-process- 
ing. The hardware support includes the queue status signals (QSO, QSl), a TEST input, 
and a mechanism 
for sharing the CPU's local bus (RQ/GT). 
The software support con- 
sists of a special class of instructions 
called ESCAPE instructions 
which activate the co- 
processor, 
and a WAIT instruction 
which samples the TEST input (used for software 
synchronization 
between the CPU and the co-processors). 
The co-processor 
interfaces directly to the CPU's local bus. The CPU's local bus 
includes the multiplexed address/status 
and address/data 
lines, the SO,SI, and S2 status 
lines, the QSOand QSI queue status lines, TEST, READY, RESET, one of the RQ/GT 
lines and, perhaps, the LOCK line. The need for the LOCK signal varies, depending on 
the co-processor. 
The co-processor 
is allowed to reside on a local bus for two reasons: 


1. 
The co-processor 
is able to monitor the CPU's activity. 


2. 
The co-processor 
has full access to the CPU's resources. 


The co-processor 
requests control of the local bus via one of the RQ/GT 
lines. 
After the CPU has relinquished 
bus control to the co-processor, 
the co-processor 
may 
run local bus cycles exactly as the CPU would. When the co-processor 
is done with the 
local bus, control is returned to the CPU via the same RQ/GT line that was used to gain 
control of the bus. An example of this interface is shown in Figure 10-1. 


The co-processor 
is activated via the following sequence; 
while the CPU is fetch- 
ing and executing instructions, 
the co-processor 
monitors the instruction 
stream look- 
ing for an ESCAPE instruction. 
Information 
that is transferred 
from memory into the 
CPU's instruction 
queue (as opposed to memory data used as an operand by an instruc- 
tion) is selected by decoding the status lines (SO, SI, S2) which identify an instruction 
fetch. AOand BHE are decoded to determine 
whether a single or double byte instruction 
fetch is performed. Information 
coming out of the queue is identified as the first instruc- 
tion byte, or a subsequent 
instruction 
byte, via the queue status lines (QSO, QSl). The 
queue status lines also indicate that no byte was fetched, or that the queue is empty. If 
the information 
coming out of the co-processor's 
queue 
(it tracks the CPU's 
queue) 
indicates an ESCAPE instruction, 
and the queue status identifies it as the first byte of an 
instruction's 
object code, then the co-processor 
is activated. The ESCAPE instruction 
object code is: 


I 
1101 
1XXX I 


Addressing mode byte. 
If mode = 11, then no bus cycle 


The Xs are "don't 
care" 
bits to the CPU, but they can represent 
64 possible 
instructions 
to the co-processor. 
In response to the ESCAPE instruction, 
the CPU uses 
the mod and rim fields as a normal addressing 
mode specification; 
it executes 
a bus 
cycle to read data from the specified address. The co-processor 
can capture both the 
address and the data present at the selected memory location. This mechanism 
allows a 
programmer 
to treat the ESCAPE instructions 
defined for the co-processor 
as normal 
CPU instructions, 
with the full range of memory addressing modes. It allows an address 
to be passed to the co-processor 
along with one parameter. 
If the co-processor needs to 
transfer additional data to or from memory, 
it may request control of the CPU's bus. 
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Note that all information 
used by the co-proceSSQr must reside in memory, since the co- 
processor does not have access to the CPU's registers. The CPU does not capture data 
read during 
the ESCAPE 
instruction, 
and except for the read bus cycle treats 
the 


instruction 
as a null operation. 
After execution 
of the ESCAPE instruction, 
both the 
CPU and co-processor 
are free to continue 
executing 
their specific tasks in parallel. 


While executing an instruction, 
the co-processor 
will typically hold the TEST line high 
to indicate that it is busy. If a program cannot guarantee 
the co-processor 
is not busy, 
before executing 
another 
ESCAPE instruction, 
the program 
should 
either 
read the 
status 
of the co-processor 
(if possible) 
or insert 
aWAIT 
instruction 
before 
each 
ESCAPE instruction. 
The WAIT instruction will force the CPU to wait until the co-pro- 
cessor is not busy before it executes the ESCAPE instruction. 
During the WAIT instruc- 
tion, the CPU does not require the bus, except possibly to fill the instruction queue. The 
CPU will, however, 
respond to interrupts 
(if enabled) during a WAIT cycle. Note that 
the co-processor may continue to monitor the CPU's instruction stream while executing 
an ESCAPE instruction. 
Although Intel has only announced 
the 8087 co-processor, 
the 
general purpose nature of the interface makes it applicable to a wide range of higher 
level commands 
for numerics, 
language support, 
etc. 


With the increasing complexity of microprocessor 
applications and the declining 
cost/performance 
ratio of the microprocessor, 
it is becoming 
more cost effective to 
design systems with more than one microprocessor. 
Multiprocessor 
systems are imple- 
mented by partitioning the design into functional subsystems 
that are easily identifiable 
and have a clean communications 
interface 
to other systems. 
After partitioning 
the 
design and defining both the hardware interface and the software interface, 
each sub- 
system may be designed and developed in parallel by individual teams as a means of 
reducing total end product design time. Although this approach requires a high degree 
of cooperation 
and coordination 
among all design teams, benefits such as modularity, 
extensibility, 
and ease of maintenance 
are provided. 
When partitioning a system into a multiprocessor 
distributed 
intelligence system, 
the requirements 
of the functional subsystems 
will dictate the need for loosely coupled, 
tightly coupled, or both loosely and tightly coupled processors. 
In the discussion 
that follows, loosely coupled 
processors 
communicate 
via a 
shared 
I/O 
mechanism; 
tightly 
coupled 
processors 
communicate 
through 
shared 
memory. For those processors that are loosely coupled, each CPU communicates 
with 
other CPUs through 
an I/O interface, which the CPU treats as it would any other I/O 
device attached to it. There are a wide variety of standard protocols for this type of inter- 
face, e.g., SDLC, ASYNC, GPIB. These interfaces require that the subsystem hardware 
provide an I/O interface, plus software to control the interface and interpret the message 
protocol. 


Loosely coupled 
subsystems 
typically require 
neither 
frequent 
nor high-speed 
communications; 
they may be physically distant from each other, even miles apart. 


If multiple processors are tightly coupled, a technique 
must be provided for shar- 
ing memory among the processors. The 8086 family shares memory by connecting it to 
the shared system bus. The memory and I/O devices accessible over this bus may be 
utilized by all CPUs capable of being bus master. 
Shared memory and I/O devices provide the desired interprocessor 
communica- 
tions mechanisms. 
The basic CPU interface to the multimaster 
system bus is shown in Figure 10-2. 
Figure 10-3 shows the basic interface for an 8086 system. The 8289 Bus Arbiter 
has the priority arbitration 
and protocol logic needed to transfer bus control. This logic 
was described in Chapter 9 for the Multibus. The 8288 bus controller provides system 
bus control signals for the CPU; it also takes care of the CPU's address and data bus 
interface with the system bus. The 8283s and 8287s implement 
this CPU's interface to 
the shared system bus. The 8288 and 8289 also let you direct bus transfers to shared 
resources on the system bus, or to private resources 
on the CPU's local bus. 
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In Figure 10-3, all resources, 
and hence all bus transfers, 
are directed over the 
system bus. To initiate a bus cycle the 8086 drives the status lines from the passive state 
(S2, SI, SO = 1,1,1) to one of the active states. The 8288 and 8289 and the CPU operate 
synchronously 
via the common clock, CLK, and detect a bus cycle request by monitor- 
ing the status lines. The 8288 will issue ALE to strobe the address from the local multi- 
plexed bus into the 8283s. If the 8289 does not have control of the system bus (is not 
driving BUSY low), it will issue a bus request 
(BREQ) 
and common 
bus request 
(CBRQ) to gain control. This protocol is identical to that described 
for the Multibus 
system bus. The 8289 will maintain AEN in an inactive (high) state until gaining bus 
control. This action prevents the 8283s from driving the address onto the system bus 
and prevents the 8288 from driving the bus command or enabling the data transceivers 
(via DEN). Note that the 8283s will capture the address during ALE, regardless of the 
state of its OE input. The inactive state of AEN is also used to disable the 8284 RDY 
input from the system. Disabling the RDY input forces the CPU to wait (by inserting 
wait states) 
until this interface gains control of the bus and completes the bus cycle. 


Once the 8289 has gained bus priority and the bus is not busy, it enables 
AEN and 
asserts BUSY to other bus masters. AEN immediately enables the address onto the bus 
and allows the 8288 to enable the data bus transceiver 
(8287s). The 8288 will not enable 
the command 
until 105 to 275 ns later, to allow address setup and chip select decode 
time in the system. To prevent the 8284 and CPU from detecting Ready immediately 
after AEN is enabled and terminating 
the bus cycle prematurely, 
the RDY input from 
the system should normally be inactive and be disabled until a command is issued by the 
8288. To satisfy this, the system 
device selected 
to participate 
in the transfer 
will 
typically not return RDY (XACK for Multibus system bus definition) 
until it has com- 
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completed 
the transfer. 
Since the time to complete the transfer 
is device dependent, 
each device provides 
an appropriate 
delay from selection 
of command 
to RDY 
(or 
XACK). After detecting Ready, the CPU returns the status lines to the passive state. 
This action allows the 8288 to terminate the bus cycle, which the 8288 does by dropping 
the command and disabling the transceivers. 
If a higher priority bus master tries to force 
this bus master off the bus during a bus cycle (via the loss of bus priority (BPRN), the 
8289 will maintain 
bus control by holding BUSY active (Jow) until the status (S2, SI, 
SO) returns to the passive state. If the 8289 has control of the bus, it maintains 
AEN 
active, allowing bus cycles initiated by the CPU to be performed immediately. 
The tim- 
ing and signal sequence under these conditions are identical to a single-CPU maximum 
mode system. The full complement 
of bus surrender 
conditions supported 
by the 8289 
are discussed later in this chapter. 
The previous discussion represented 
a single CPU with all memory and I/O con- 


necting to the multimaster 
system bus. If you consider how many clock cycles are availa- 
ble for bus cycles, assuming 
four clocks per bus cycle, and count the number 
of bus 
cycles executed, 
you will discover that a single 8086 CPU can utilize between 50% and 
80% of the available bus band width, depending on whether the application is compute 
bound. If two 8086s are attached to a shared bus, with all memory and I/O for both pro- 
cessors connecting to this shared bus, then the throughput 
for each CPU may degrade 
by up to 37%: 


80 - 50 
= 
37% 


80 


This degradation is a direct consequence 
of contention 
for access to the shared bus. This 
calculation assumes that each CPU could use 80% of the total available bus cycles, but 
only receives 50% of the available bus cycles, since it is sharing a single bus with the 
other CPU. As more CPUs are added to the system, more degradation 
occurs for all 
CPUs. To gain appreciable benefits from multiprocessor 
systems, there must be a high 
degree of concurrent 
processing. Therefore, 
delays due to contention 
for access to the 
shared bus must be minimized. 


As a solution to this problem, the 8086 family allows local and shared resources to 
be defined for each CPU. Local resources may include memory and/or I/O, which con- 
nect to the CPU's local or resident bus and can only be accessed by the CPU to which 
they are assigned. Shared resources, 
in contrast, consist of memory and I/O which con- 
nect directly to the multimaster 
system bus and are accessible by more than one master. 


Two primary techniques 
for partitioning resources are supported by the 8288 and 
8289: address mapping and memory versus I/O. Figures 10-4a through 
10-4c demon- 
strate the variations 
derivable from these techniques. 
The common elements 
in all of 
these examples are the inclusion of a second set of address latches and use of both 8284 
RDY inputs. The major differences 
in the configurations 
presented 
in Figures 
10-4a 
through 
1O-4c are the number 
of additional components 
required to support the local 
resources, 
and the types of local resources supported. 
Figure 1O-4a allows ROM or EPROM on the CPU's local bus, and only requires 
additional address latches and possible address decode latches. The 8289 is strapped into 
the resident bus mode (RESB to Vcc), which tells the 8289 that this CPU supports resi- 
dent memory. Strapping options on the 8288 do not differ from the configuration shown 
in Figure 10-3. During a bus cycle, the 8288 issues ALE, latching the address into both 
the 8283s and 8282s. The address space assigned to the resident memory is decoded and 
used to determine 
whether this bus cycle will use resident or system resources. 
If the 


bus cycle is to utilize the system bus, then the decode result must be high to indicate 
SYSB to the 8289, CEN to the 8288, and to disable the local bus RDY via the associated 
AEN. When the 8289 is in the RESB mode, the SYSB/RESB input to the 8289 deter- 
mines whether the 8289 will request or release the system bus. If SYSB/RESB is high, 
the 8289 will request the system bus; ifSYSB/RESB is low, the 8289 will release the bus. 
The CEN input of the 8288 prevents 
the control outputs 
from being driven to their 
active state during resident bus accesses. This input is necessary to prevent a control sig- 
nal from being issued to the system if the 8289 has control of the bus. The input does 
not cause the 8288 to float the control outputs, since the 8288 must maintain them in an 
inactive state as long as the 8289 has control of the bus. Note that the 8282s are always 
enabled (OE strapped low); this allows each bus cycle address to quickly decode into the 
appropriate 
system bus or resident bus select. 


The stable address latched in the 8282s also provides chip select and addresses to 
the local ROM or EPROM. The read signal from the 8086 (RD) is used to enable the 
selected local device to drive data onto the local multiplexed 
bus. When local resources 
are selected, the 8284 AEN2 input is enabled to allow a local RDY2 to generate Ready 
to the CPU. Since the 8288 will not drive control signals on the system bus, RDYI 
(or 
XACK) should not be received, even if the 8289 has bus control and AENI is active. If 
the local resources require buffering from the local multiplexed bus, the local chip select 
and RD should be used to enable the buffers. If the 110 address space overlaps the local 
memory address space, S2 should be latched and included in the local address decode to 
prevent an 110 address from decoding into a resident bus access. 
This technique only works for local ROM and EPROM in the memory space, since 
no 110 commands 
or write commands 
are available for the local bus. Note, however, 
that object code accesses typically utilize 50% to 70% of the bus cycles. Therefore, 
if the 
majority of the object code executed by the CPU is in local ROM or EPROM, the system 
bus utilization per CPU can be reduced to less than 30%. All RAM and 110 must reside 
on the shared system bus and therefore 
will be subject to contention. 
Figure 10-4b adds local 110 to the previous configuration 
shown in Figure 10-4a. 
This is achieved by strapping both the 8288 and 8289 to operate in the lOB (I/O Bus) 
mode. In this mode, all 110 is assumed to reside on the local bus; thus 110 does not 
require access to the system bus. But when the lOB option is used, the CPU cannot 
access 110 on the system bus unless it is memory mapped. For each 110 bus transfer, the 
8289 will not attempt to gain bus control and may release the bus if it currently has con- 
trol. The 8288 will drive the 110 control outputs, 
regardless of the state of the AEN 
input. In this mode, the 8288 110 control outputs must not be connected to the system 
bus 110 control lines. They should only be used to provide control to the local 110 
devices. No additional circuitry is required 
to support this extension 
to Figure 1O-4a. 
The AEN2 input to the 8284 is now enabled by a chip select to either the local memory 
or local 110. The 8288 CEN input must be high during 110 cycles to allow the 110 con- 
trol output to be driven low (active). If the data lines to/from 
the 110 devices must be 
buffered from the local multiplexed bus, the DT/R signal from the 8288 can be used as a 
direction control signal for the buffers. To enable the buffers, the 8288 can be used as a 
direction control signal for the buffers. To enable the buffers, the 8288 provides a sepa- 
rate peripheral data enable control (PDEN). This signal is only available when the 8288 
is strapped in the 110 Bus (lOB) mode. PDEN is enabled for 110 transfers, while DEN is 
enabled only for data transfers over the system bus. 
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The next extension to the configuration alternatives 
is shown in Figure lO-4c. For 
this case, we have added a second 8288 and data bus transceivers. 
This configuration 
is 
the most flexible since it is totally based on address mapping, and allows access to both 
shared and local ROM/EPROM, 
RAM and I/O. The lOB mode is not used on either 
8288. Since the local bus is not a multimaster 
bus, a second 8289 is not required. The 
PROM or decoder sources in the address map are for both memory and I/O references. 
The resulting SYSB/RESB signal is inverted to provide the proper enable polarity for the 
resident 
bus 8288 CEN input. The 8288 provides address latching and full data bus 
transceiver 
control for the resident data bus, and a complete set of read and write con- 
trols for memory and I/O. The multimaster 
system bus interface functions as described 
for configuration 
IO-4a. 


The configuration 
in Figure 
1O-4c allows the CPU to support 
resident 
ROMI 
EPROM for fixed program and constants, 
resident RAM for stacks and local variables, 


and resident I/O which only this CPU need access. The result is modular self-contained 
processing modules which communicate 
control and data via shared memory or shared 
I/O. This allows a high degree of concurrent 
processing within the system with a typical 
system bus usage per CPU reduced to less than 25%. 
A final extension to the concept of multiple busses per CPU is to implement a sec- 
ond multimaster 
bus, rather than a resident bus. This is useful in fault tolerant systems; 


also, 
it allows performance 
enhancements 
based 
on multiple 
interprocessor 
com- 
munication 
channels. 
In this case, a CPU wishing to access the system bus would first 
attempt to utilize the primary bus; if this is not available, the secondary bus would be 
accessed. 


In addition to the RESB and lOB modes, the 8289 has several other options that 
optimize 
multi-master 
system 
bus use. These 
additional 
capabilities 
affect the bus 
release rather than its request. With respect to bus request operations, 
one additional 
comment needs to be made. With the exception of the RESB mode, the 8289 will issue a 
bus request on the second high-to-low clock transition 
(BCLK) following the low-to- 
high CPU clock transition ofT2. The one-CPU clock delay for the RESB mode provides 
address decode time for generating a stable SYSB/RESB 
signal. 


The 8289 provides LOCK, ANYRQST, and CRQLCK inputs for additional con- 
trol over bus control release. In general, the 8289 will relinquish 
control of the bus 
either at the end of the current cycle or immediately 
(if no bus cycle is in progress) when 
it loses priority (BPRN goes high). An exception arises if the LOCK input is active. The 
LOCK allows the 8289 to maintain bus control (continue to drive BUSY low) regardless 
of bus priority. The 8289 LOCK input should be driven by the CPU LOCK output to 
guarantee 
the CPU will maintain 
bus control during execution 
of the instruction 
pre- 
ceded by the LOCK prefix and during interrupt acknowledge sequences. The LOCK sig- 
nal allows operations 
like locked exchanges to serve as primitives for basic semaphore 
control of non-reentrant 
shared resources 
(also known as critical code sections). 


ANYRQST is a strapping option which forces the 8289 to release the bus either at 
the end of the current bus cycle or immediately 
(if no bus cycle is in progress) if Com- 
mon Bus Request 
(CBRQ) is active. Assertion of ANYRQST forces the 8289 to release 
the bus to a lower priority bus master. Releasing the bus for this case requires the 8289 
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'By adding another 8289 
arbiter and connecting its AEN to the 8289 
whose AEN is presently grounding, 


the processor could have access to two multi master busses. 


to drop its bus request (BREQ returns high) and enable BPRO (Bus Priority Out). (Note 
that BREQ is used for parallel priority resolving circuits, while BPRO is provided to sup- 
port a serial priority arbitration 
scheme 
via daisy chaining 
BPRO to the next lower 
priority BPRN. Only one of the two techniques is applicable within a single system). The 
8289 will then release the bus one bus clock period later by releasing BUSY. The 8289 
must release its BREQ and BPRO to allow the lower priority 8289 to receive BPRN. For 
the normal case of a higher priority master forcing another 8289 off the bus, the higher 
priority master receives BPRN from the priority scheme and does not require the lower 
priority master to release its BPRO or BREQ. 
If ANYRQST is strapped high and CBRQ is tied low, the 8289 will release the bus 
at the completion 
of each bus cycle. This may impose a high overhead on bus transfer 
logic, but it may be useful for masters which infrequently use the system bus, or use it at 
a very low band width. 


If ANYRQST 
is not tied high, the 8289 will release the bus to a lower priority 
device on CBRQ only when the CPU local bus is idle (S2, SI, SO = 1,1,1). To override 
CBRQ and never release the bus to a lower priority master, CRQLCK (Common 
Bus 
Request Lock) is provided. Strapping this input low effectively disables the CBRQ as an 
input. Although 
the same effect can be attained 
by tying CBRQ high, the strapping 
option allows it to be a programmable 
option rather than a static function. 


Note: CBRQ is bidirectional 
- 
an 8289 will drive it low to get the bus and moni- 
tor it when it has the bus. The CRQLCK therefore allows an 8289 to ignore CBRQ while 
it has the bus yet use CBRQ to get the bus. 


Other special conditions which cause the 8289 to release the bus are: an I/O cycle 
with the 8289 strapped for I/O Bus (lOB) mode, a resident bus cycle where the 8289 is 
in the RESB mode, and whenever 
the CPU enters the halt state by executing a HALT 
instruction. 
Entering the halt state is indicated to the 8289 via the halt status on the 
status lines (S2, SI, SO = 0, I, 1). 


The 8086 Instruction Set 


Listed Alphabetically 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


AAA 
37 
1 
4 


AAD 
D5 
2 
60 


OA 


AAM 
D4 
2 
83 


OA 


AAS 
3F 
1 
4 


ADC 
aC,data 
0001010w 
2 or 3 
4 


kk 
[fi} 


ADC 
mem/reg l,data 
100000sw 
3, 4, 5 
reg: 4 


mod 010 
rIm 
or 6 
[DISP] 
mem: 
17 + EA 


[DISP] 
kk 
[fi} 
ADC 
mem/regl,mem/reg2 
000100dw 
2,3 
or 4 
reg to reg: 3 


mod rrr rIm 
mem to reg: 9 + EA 


[DISP] 
reg to mem: 
16 + EA 


[DISP] 
ADD 
aC,data 
000001 Ow 
2 or 3 
4 


kk 
[jU 


ADD 
mem/reg,data 
100000sw 
3, 4, 5 
reg: 4 


mod 000 rIm 
or 6 
mem: 
17 + EA 


[DISP] 
[DISP] 
kk 
[jj] 


ADD 
mem/reg l,mem/reg2 
OOOOOOdw 
2,3 
or 4 
reg to reg: 3 


mod rrr rIm 
mem to reg: 9 + EA 


[DISP] 
reg to mem: 
16 + EA 


[DISP] 


AND 
aC,data 
0010010w 
2 or 3 
4 


kk 
[jj] 


AND 
mem/reg,data 
1000000w 
3,4,5 
or 
reg: 4 


mod 100 rIm 
6 
[DISP] 
mem:17 
+ EA 


[DISP] 
kk 
[fi} 


AND 
mem/reg l,mem/reg2 
001000dw 
2,3 
or 4 
reg to reg: 3 


mod rrr rIm 
mem to reg: 9 + EA 


[DISP) 
reg to mem: 
16 + EA 


[DISP] 


CALL 
addr 
9A 
5 
28 


kk 
jj 
hh 
gg 


CALL 
disp16 
E8 
3 
19 


kk 
jj 


CALL 
mem 
FF 
2,3 
or 4 
32-bit 
mem pointer: 


mod 011 
rIm 
37 + EA 


[DISP] 
[DISP] 


CALL 
mem/reg 
FF 
2,3, 
or 4 
16-bit reg pointer: 


mod 010 rIm 
16 
[DISP] 
16-bit mem pointer: 


[DISP] 
21 + EA 


, 


Instruction 
Object 
Code 
Bytes 
Clock Periods 


caw 
98 
1 
2 


CLC 
F8 
1 
2 


CLD 
FC 
1 
2 


CLI 
FA 
1 
2 


CMC 
F5 
1 
2 


CMP 
aC,data 
0011110w 
2 or 3 
4 


kk 
[li! 


CMP 
mem/reg,data 
100000sw 
3,4,5 
or 
reg: 4 
mod 111 rIm 
6 
mem: 
10 + EA 


[DISP) 
[DISP) 


kk 
Ill) 


CMP 
mem/reg 1,mem/reg2 
001110dw 
2,3 
or 4 
reg to reg: 3 


mod rrr rIm 
mem to reg: 9 + EA 
[DISP] 
reg to mem: 9 + EA 
[DISP] 
CMPS 
1010011w 
1 
22 


9 + 22/repetition' 


CWD 
99 
1 
5 
DAA 
27 
1 
4 
DAS 
2F 
1 
4 
DEC 
mem/reg 
lllllllw 
2,3 
or 4 
reg: 3 
mod 001 rIm 
[DISP) 
mem: 
15 + EA 


[DISP) 


DEC 
16-bit reg 
01001rrr 
1 
2 


DIV 
mem/reg 
1111011w 
2,3 
or 4 
8-bit reg: 


mod 110 rIm 
80 - 
90 
[DISP] 
16-bit reg: 


[DISP] 
144 - 
162 


8-bit mem: 
(86 - 
96) + EA 


16-bit mem: 
(150 - 
1681 + EA 


ESC 
mem/reg 
11011xxx 
2,3 
or 4 
mem: 8 + EA 


mod xxx rIm 
reg: 2 
[DISP] 
[DISP) 


HLT 
F4 
1 
2 


IDIV 
mem/reg 
1111011w 
2,3 
or 4 
8-bit reg: 


mod 111 rIm 
101 -112 


[DISP) 
16-bit reg: 


[DISP) 
165 - 
184 


8-bit mem: 
(107 -118) 
+ EA 
16-bit mem: 
(171 - 
1901 + EA 


IMUL 
mem/reg 
1111011w 
2,3 
or 4 
8-bit reg: 


mod 101 rIm 
80 - 
98 
[DISP] 
16-bit reg: 


[DISP) 
128-154 
8-bit mem: 
(86 - 
104) + EA 


16-bit mem: 
(134 - 
160) + EA 


IN 
ac, DX 
1110110w 
1 
8 


IN 
ac, port 
1110010w 
2 
10 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


INC 
mem/reg 
1111111w 
2,3 
or 4 
reg: 3 


mod 000 
rim 
mem: 
15 + EA 
[DISP] 
[DISP] 


INC 
16-bit 
reg 
01000m 
1 
2 


INT 
11001100· 
1 
52 


11001101 
2 
51 


type 


INTO 
CE 
1 
interrupt: 
53 


no interrupt: 
4 


IRET 
CF 
1 
24 


JA 
disp 
77 
2 
4/No 
Branch 


JNBE 
disp 
16/Branch 


JAE 
disp 
73 
2 
4/No 
Branch 


JNB 
disp 
16/Branch 


JB 
disp 
72 
2 
4/No 
Branch 
JNAE 
disp 
B/Branch 


JBE 
disp 
76 
2 
4/No 
Branch 
JNA 
disp 
16/Branch 
JCXZ 
disp 
E3 
2 
6/No 
Branch 


disp 
1B/Branch 
JE 
disp 
74 
2 
4/No 
Branch 


JZ 
disp 
16/Branch 


JG 
disp 
7F 
2 
4/No 
Branch 


JNLE 
disp 
16/Branch 


JGE 
disp 
7D 
2 
4/No 
Branch 
JNL 
disp 
16/Branch 
JL 
disp 
7C 
2 
4/No 
Branch 
JNGE 
disp 
16/Branch 
JLE 
disp 
7E 
2 
4/No 
Branch 
JNG 
disp 
16/Branch 


JMP 
addr 
EA 
5 
15 


kk 
li 
hh 
gg 


JMP 
disp 
EB 
2 
15 


disp 


JMP 
disp16 
E9 
3 
15 


kk 
jj 


JMP 
mem 
FF 
2,3 
or 4 
mem ptr 32: 


mod 
101 rim 
24 + EA 


[DISP] 
[DISP) 


JMP 
mem/reg 
FF 
2,3 
or 4 
reg ptr 16: 


mod 100 rr/m 
11 


[DISP] 
mem ptr 16: 


[DISP] 
18 + EA 


JNE 
disp 
75 
2 
4/No 
8ranch 
JNZ 
disp 
16/Branch 
JNO 
disp 
71 
2 
4/No 
Branch 


disp 
16/Branch 


JNP 
disp 
7B 
2 
4/No 
Branch 


JPO 
disp 
16/Branch 


JNS 
disp 
79 
2 
4/No 
Branch 


disp 
16/Branch 


JO 
disp 
70 
2 
4/No 
Branch 


disp 
16/Branch 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


JP 
disp 
7A 
2 
4/No 
Branch 


JPE 
disp 
16/Branch 


JS 
disp 
78 
2 
4/No 
Branch 


disp 
16/Branch 


LAHF 
9F 
1 
4 


LDS 
reg,mem 
C5 
2,3 
or 4 
16 + EA 
mod rrr rim 
[DISP] 
[DISP] 


LEA 
reg,mem 
8D 
2,3 
or 4 
2 + EA 
mod rrr rim 
[DISP] 
[DISP] 
LES 
reg,mem 
C4 
2,3 
or 4 
16 + EA 
mod rrr rim 
[DISP] 
[DISP] 


LOCK 
FO 
1 
2 


LODS 
1010110w 
1 
12 


9 + 13/repetition" 


LOOP 
disp 
E2 
2 
5/No Branch 


disp 
17/Branch 


LOOPE 
disp 
E1 
2 
6/No 
Branch 
LOOPZ 
disp 
18/Branch 


LOOPNE 
disp 
EO 
2 
5/No Branch 
LOOPNZ 
disp 
19/Branch 


MOV 
mem/reg 
l,mem/reg2 
100010dw 
2,3 
or 4 
reg to reg: 2 


mod rrr rim 
reg to mem: 
8 + EA 
[DISP] 
mem to reg: 9 + EA 
[DISP] 


MOV 
reg,data 
1011wrrr 
2 or 3 
4 
kk 
(jj) 


MOV 
ac.mem 
1010000w 
3 
10 


kk 
jj 


MOV 
mem,ac 
1010001w 
3 
10 
kk 
jj 


MOV 
segreg,mem/reg 
8E 
2,3 
or 4 
reg to reg: 2 


mod Orr rim 
mem to reg: 8 + EA 
[DISP] 
[DISP] 


MOV 
mem/reg,segreg 
8C 
2,3 
or 4 
reg to reg: 2 


mod Orr rim 
reg to mem: 
9 + EA 
[DISP] 
[DISP) 


MOV 
mem/reg,data 
1100011w 
3,4,5 
or 
reg/mem: 
10 + EA 


mod 000 
rim 
6 


[DISP] 
[DISP] 


kk 
[ill 


MOVS 
1010010w 
1 
18 
9 + 17Ire petition " 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


MUL 
mem/reg 
1111011w 
2,3 
or 4 
8-bit reg: 


mod 100 rim 
70-77 
[DISP) 
16-bit reg: 


[DISP) 
118 - 
133 


8-bit mem: 
(76 - 
831 + EA 


16-bit 
mem: 


(124 - 
1391 + EA 


NEG 
mem re9 . 
1111011w 
2,3 
or 4 
reg: 3 
mod 011 
rim 
mem: 
16+EA 


[DISP] 
[DISP] 


NOP 
90 
1 
3 


NOT 
mem reg 
1111011w 
2,3 
or 4 
reg: 3 
mod 010 rim 
mem: 
16 + EA 


[DISP] 
[DISP) 


OR 
aC,data 
0000110w 
2 or 3 
4 


kk 
ljil 


OR 
mem/reg,data 
1000000w 
3,4,5 
or 
reg: 4 
mod 001 
rim 
6 
mem: 
17 + EA 


[DISP] 
[DISP) 
kk 
[li! 


OR 
mem/regl.mem/reg2 
000010dw 
3. 4. 5 or 6 
reg to reg: 
3 


mod rrr rim 
mem to reg: 9 + EA 


[DISP) 
reg to mem: 
16 + EA 
[DISP) 
kk 
[jj] 


OUT 
DX.ae 
1110111w 
1 
8 
OUT 
port.ae 
1110011w 
2 
10 


yy 


POP 
mem/reg 
8F 
2.3 
or 4 
reg: 8 
mod 000 
rim 
mem: 
17 + EA 


[DISP) 
[DISP) 


POP 
reg 
01011rrr 
1 
8 
POP 
segreg 
OOOsslll 
1 
8 


POPF 
9D 
1 
8 
PUSH 
mem/reg 
FF 
2,3 
or 4 
reg: 
11 
mod 110 rim 
mem: 
16 + EA 


[DISP] 
[DISP] 


PUSH 
reg 
01010rrr 
1 
10 
PUSH 
segreg 
000ssl10 
1 
10 
PUSHF 
9C 
1 
10 
RCL 
mem/reg,eount 
110100cw 
2. 3 or 4 
count = 1 
mod 010 
rim 
reg: 2 
[DISP) 
mem:15 
+ EA 


[DISP) 
count = [CLI 
reg: 8 + (4· 
NI 


mem: 
20 + EA + (4· 
N) 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


RCR 
mem/reg,count 
110100cw 
2,3 
or 4 
count - 
1 
mod 011 rim 
reg: 2 
[DISP] 
mem:15 
+ EA 


[DISP] 
count = [CLI 
reg: 8 + (4 • N) 
mem: 
20 + EA + (4· 
N) 


REP 
IREPE/REPNE 
1111001z 
1 
2 


RET 
(Inter-segment) 
CB 
1 
18 


RET 
(lntra-segmentl 
C3 
1 
8 


RET 
disp 16 (Inter-segment) 
CA 
3 
17 


kk 
jj 


RET 
disp 16 (Intra-segment) 
C2 
3 
12 


kk 
jj 


RDL 
mem/reg,count 
110100cw 
2,3 
or 4 
count = 1 
mod 000 rim 
reg: 2 


[DISP] 
mem:15 + EA 


[DISP] 
count = [CLI 
reg: 8 + (4 • NI 
mem: 
20 + EA + (4· 
NI 


RDR 
mem/reg,count 
110100cw 
2,3 
or 4 
count = 
·1 
mod 001 rim 
reg: 2 


[DISP] 
mem:15 
+ EA 
[DISP] 
count = [CLl 
reg: 8 + (4 • N) 
mem: 
20 + EA + (4 • N) 


SAHF 
9E 
1 
4 


SAR 
mem/reg,count 
110100cw 
2,3 
or 4 
count = 1 
mod 111 rim 
reg: 2 


[DISP] 
mem:15 
+ EA 


[DISP] 
count = [CLl 
reg: 8 + (4· 
N) 


mem: 
20 + EA + (4 • N) 


SBB 
aC,data 
0001110w 
2 or 3 
4 


kk 
[jjl 


SBB 
mem/reg,data 
100000sw 
3,4,5 
or 
reg: 4 
mod 011 rim 
6 
mem: 
17 + EA 


[DISP] 
[DISP] 


kk 
[jj] 


SBB 
mem/reg 1,mem/reg2 
000110dw 
2,3 
or 4 
reg from reg: 3 


mod rrr rim 
mem from reg: 9 + EA 


[DISP] 
reg from mem: 
16 + EA 
[DISP] 


SCAS 
1010111w 
1 
15 


9 + 15/repetition· 


SEG 
segreg 
001ss110 
1 
2 


SHL 
mem/reg,count 
110100cw 
2,3 
or 4 
count = 1 


SAL 
mod 100 rim 
reg: 2 
[DISP] 
mem:15 
+ EA 


[DISP] 
count = [CLl 
reg: 8 + (4· 
N) 


mem: 
20 + EA + (4' 
N) 


• When preceded by REP prefix 


N = count value in CL 


Instruction 
Object 
Code 
Bytes 
Clock 
Periods 


SHR 
mem/reg,count 
110100cw 
2,3 
or 4 
count 
- 
1 


mod 
101 rIm 
reg: 
2 
(DISP] 
mem:15 
+ EA 


(DISP] 
count = [CLl 
reg: 
8 + 14 • N) 


mem: 
20 + EA + (4" 
N) 


STC 
F9 
1 
2 


STD 
FD 
1 
2 


STI 
FB 
1 
2 


STOS 
1010101w 
1 
11 


9 + 10/repetition" 


SUB 
aC,data 
0010110w 
2 or 3 
4 


kk 
Iii] 


SUB 
mem/reg,data 
100000sw 
3,4,5 
or 
reg: 
4 
mod 
101 rIm 
6 
mem: 
17 + EA 


(DISP] 
(DISP] 
kk 
(jj] 


SUB 
mem/reg 1,mem/reg2 
001010dw 
2,3 
or 4 
reg from 
reg: 
3 


mod rrr rIm 
mem from 
reg: 
9 + EA 


(DISP] 
reg from 
mem: 
16 + EA 


(DISP] 


TEST 
aC,data 
1010100w 
2 or 3 
4 


kk 
(jj] 


TEST 
mem/reg,data 
1111011w 
3,4,5 
or 
reg: 
5 
mod 000 
rIm 
6 
mem: 
11 + EA 


(DISP] 
(DISP] 
kk 
(jj) 


TEST 
reg,mem/reg 
1000010w 
2,3 
or 4 
reg with 
reg: 
3 


mod rrr rIm 
reg with 
mem: 
9 + EA 


[DISP] 
[DISP) 


WAIT 
9B 
1 
3Imin.) + 5n 


XCHG 
reg,ac 
10010rrr 
1 
3 


XCHG 
reg,mem/reg 
1000011w 
2, 3 or 4 
reg with 
reg: 4 


mod rrr rIm 
reg with 
mem: 
17 + EA 


[DISP] 
[DISP] 


XLAT 
D7 
1 
11 


XOR 
aC,data 
0011010w 
2 or 3 
4 


kk 
Ujl 


XOR 
mem/reg,data 
1000000w 
3,4,5 
or 
reg: 4 
mod 
110 rIm 
6 
mem: 
17 + EA 


[DISP] 
[DISP] 
kk 
(jj] 


XOR 
mem/reg 1,mem/reg2 
001100dw 
2,3 
or 4 
reg with 
reg: 
3 


mod rrr rIm 
mem with 
reg: 
9 + EA 
[DISP] 
reg with 
mem: 
16 + EA 


[DISP) 


·When 
preceded 
by REP prefix 


n = cloc~s per sample of the TEST input 


The 8086 
Instruction 
Set 


Object Codes in 


Ascending Numeric Sequence 


Object 
Code 
Mnemonic 


Byte * 0 
Byte * 1 
Succeeding 
Bytes 


00 
mod reg rIm 
(disp)[disp] 
ADD mem/reg,reg 
(byte) 


01 
mod reg rIm 
(disp)[disp] 
ADD mem/reg,reg 
(word) 


02 
mod reg rIm 
(disp)[disp] 
ADD reg, mem/reg 
(byte) 


03 
mod reg rIm 
(disp)[disp] 
ADD reg, mem/reg 
(word) 


04 
kk 
ADD AL,kk 


05 
kk 
li 
ADD AX, likk 


06 
PUSH ES 
07 
POP ES 


08 
mod reg rIm 
(disp)[disp] 
OR mem/reg,reg 
(byte) 


09 
mod reg rIm 
(disp](disp] 
OR mem/reg,reg 
(word) 


OA 
mod reg rIm 
(disp)[disp] 
OR reg,mem/reg 
(byte) 


08 
mod reg rIm 
(disp)[disp] 
OR reg,mem/reg 
(word) 


OC 
kk 
OR AL,kk 


OD 
kk 
li 
OR AL,likk 
OE 
PUSH CS 


OF 
Not used 


10 
mod reg rIm 
(disp)[disp] 
ADC mem/reg,reg 
(byte) 


11 
mod reg rIm 
(disp)[disp] 
ADC mem/reg,reg 
(word) 


12 
mod reg rIm 
(disp)[disp] 
ADC reg,mem/reg 
(byte) 


13 
mod reg rIm 
(disp)[disp] 
ADC reg,mem/reg 
(word) 


14 
kk 
ADC AL,kk 
15 
kk 
li 
ADC AX,likk 


16 
PUSH SS 


17 
POP SS 


18 
mod reg rIm 
[disp)[disp] 
SBB mem/reg,reg 
(byte) 
19 
mod reg rIm 
[disp)[disp) 
SBB mem/reg,reg 
(word) 
1A 
mod reg rIm 
(disp)[disp] 
SBB reg,mem/reg 
(byte) 


1B 
mod reg rIm 
[disp)[dispJ 
SBB reg,mem/reg 
(word) 


1C 
kk 
SBB AL,kk 


1D 
kk 
li 
SBB AX,likk 


1E 
PUSH DS 


1F 
POP DS 


20 
mod reg rIm 
(disp)[disp] 
AND mem/reg,reg 
(byte) 
21 
mod reg rIm 
[disp)[disp] 
AND mem/reg,reg 
(word) 
22 
mod reg rIm 
(disp)[disp] 
AND reg,mem/reg 
(byte) 
23 
mod reg rIm 
(disp)[disp] 
AND reg,mem/reg 
(word) 
24 
kk 
AND AL,kk 


25 
kk 
li 
AND AX,likk 


26 
SEG ES 


27 
DAA 


28 
mod reg rIm 
(disp)[disp] 
SUB mem/reg,reg 
(byte) 
29 
mod reg rIm 
(disp)[disp] 
SU8 mem/reg,reg 
(word) 
2A 
mod reg rIm 
(disp)[disp] 
SUB reg,mem/reg 
(byte) 
2B 
mod reg rIm 
(disp)[disp] 
SUB reg,mem/reg 
(word) 
2C 
kk 
SUB AL,kk 


2D 
kk 
li 
SUB AX,jjkk 


2E 
SEG CS 


2F 
DAS 


Object 
Code 
Mnemonic 


Byte * 0 
Byte * 1 
Succeeding 
Bytes 


30 
mod reg rim 
[disp] [disp) 
XOR mem/reg,reg 
(byte) 


31 
mod reg rim 
[disp][disp) 
XOR mem/reg,reg 
(word) 


32 
mod reg rim 
[disp] [disp) 
XOR reg,mem/reg 
(byte) 


33 
mod reg rim 
[disp] [disp] 
XOR reg,mem/reg 
(word) 


34 
kk 
XOR AL,kk 


35 
kk 
jj 
XOR AX,jjkk 


36 
SEG SS 


37 
AAA 


38 
mod reg rim 
[disp][disp) 
CMP mem/reg,reg 
(byte) 


39 
mod reg rim 
[disp][disp) 
CMP mem/reg,reg 
(word) 


3A 
mod reg rim 
[disp][disp) 
CMP reg,mem/reg 
(byte) 
3B 
mod reg rim 
[disp][disp) 
CMP reg,mem/reg 
(word) 


3C 
kk 
CMP AL,kk 


30 
kk 
jj 
CMP AX,jjkk 


3E 
SEG OS 


3F 
AAS 


40 
INC AX 


41 
INC CX 


42 
INC OX 


43 
INC BX 


44 
INC SP 


45 
INC BP 


46 
INC SI 


47 
INC 01 


48 
OEC AX 
49 
OEC CX 
4A 
OEC OX 
4B 
OEC BX 
4C 
OEC SP 


40 
OEC BP 


4E 
OEC SI 


4F 
OEC 01 


50 
PUSH AX 
51 
PUSH CX 
52 
PUSH OX 
53 
PUSH BX 


54 
PUSH SP 


55 
PUSH BP 


56 
PUSH SI 


57 
PUSH 01 


58 
POP AX 
59 
POP CX 
5A 
POP OX 
5B 
POP BX 
5C 
POP SP 


50 
POP BP 


5E 
POP SI 


5F 
POP 01 


60-6F 
Not Used 


Object 
Code 
Mnemonic 


Byte * 0 
Byte * 1 
Succeeding 
Bytes 


70 
disp 
JO disp 


71 
disp 
JNO disp 


72 
disp 
JB or JNAE or JC disp 


73 
disp 
JNB or JAE or JNC disp 


74 
disp 
JE or JZ disp 


75 
disp 
JNE or JNZ disp 


76 
disp 
JBE or JNA disp 


77 
disp 
JNBE or JA disp 


78 
disp 
JS disp 
79 
disp 
JNS disp 


7A 
disp 
JP or JPE disp 


7B 
disp 
JNP or JPO disp 


7C 
disp 
JL or JNGE disp 


7D 
disp 
JNL or JGE disp 


7E 
disp 
JLE or JNG disp 


7F 
disp 
JNLE or JG disp 


80 
mod 000 rIm 
[disp)[disp] 
kk 
ADD mem/reg.kk 


80 
mod 001 
rIm 
[disp)[disp] 
kk 
OR mem/reg.kk 


80 
mod 010 rIm 
[disp)[disp] 
kk 
ADC mem/reg.kk 


80 
mod 011 rIm 
[disp)[disp] 
kk 
SBB mem/reg.kk 


80 
mod 100 rIm 
[disp)[disp] 
kk 
AND mem/reg.kk 


80 
mod 101 rIm 
[disp)[disp] 
kk 
SUB mem/reg.kk 


BO 
mod 110 rIm 
[disp)[disp] 
kk 
XOR mem/reg. 
kk 


80 
mod 111 rIm 
[disp)[disp] 
kk 
CMP mem/reg.kk 
81 
mod 000 
rIm 
[disp)[disp] 
kkjj 
ADD mem/reg.jjkk 


81 
mod 001 
rIm 
[disp)[disp] 
kkjj 
OR mem/reg.jjkk 


81 
mod 010 
rIm 
[disp)[disp] 
kkij 
ADC mem/reg.ijkk 


81 
mod 011 rIm 
[disp)[disp] 
kkij 
SBB mem/reg.ijkk 
81 
mod 100 rIm 
[disp)[disp] 
kkij 
AND mem/reg.ijkk 
81 
mod 101 rIm 
[disp)[disp] 
kkjj 
SUB mem/reg.ijkk 
81 
mod 110 rIm 
[disp)[disp] 
kkjj 
XOR mem/reg.jjkk 


81 
mod 111 rIm 
[disp)[disp] 
kkjj 
CMP mem/reg.jjkk 


82 
mod 000 
rIm 
[disp)[disp] 
kk 
ADD mem/reg.kk 
(byte) 


82 
xx 001 xxx 
Not used 


82 
mod 010 
rIm 
[disp)[disp] 
kk 
ADC mem/reg.kk 
(byte) 


82 
mod 011 rIm 
[disp)[displ 
kk 
SBB mem/reg.kk 
(byte) 


82 
xx 100 xxx 
Not used 
82 
mod 101 rIm 
[disp)[disp] 
kk 
SUB mem/reg.kk 
(byte) 
82 
xx 110 xxx 
Not used 
82 
mod 111 rIm 
[disp)[disp] 
kk 
CMP mem/reg.kk 
(byte) 


83 
mod 000 
rIm 
[disp)[disp] 
kk 
ADD mem/reg.ijkk 
(word-sign extended) 


83 
xx 001 
xxx 
Not used 
83 
mod 010 
rIm 
[disp)[disp] 
kk 
ADC mem/reg.ijkk 
(word-sign extended) 


83 
mod 011 
rIm 
[disp)[disp] 
kk 
SBB mem/reg.jjkk (word-sign extended) 


83 
xx 100 rIm 
Not used 
83 
mod 101 rIm 
[disp)[dispJ kk 
SUB mem/reg.jjkk (word-sign extended) 
83 
xx 110 xxx 
Not used 
83 
mod 111 rIm 
[disp)[disp) kk 
CMP mem/reg.ijkk 
(word-sign extended) 


84 
mod reg rIm 
[disp)[disp) 
TEST mem/reg.reg 
(byte) 


85 
mod reg rIm 
[disp)[disp) 
TEST mem/reg.reg 
(word) 
86 
mod reg rIm 
[disp)[disp] 
XCHG reg.mem/reg 
(byte) 


87 
mod reg rIm 
[disp)[disp] 
XCHG reg.mem/reg 
(word) 


88 
mod reg rIm 
[disp)[disp] 
MOV mem/reg.reg 
(byte) 
89 
mod reg rIm 
[disp)[disp] 
MOV mem/reg.reg 
(word) 


Object 
Code 
Mnemonic 


Byte '* 0 
Byte '* 1 
Succeeding 
Bytes 


8A 
mod reg rim 
[disp](disp] 
MOV reg,mem/reg 
[byte) 


88 
mod reg rim 
[disp](disp) 
MOV reg,mem/reg 
(word) 


8C 
mod ass rim 
[disp](disp) 
MOV mem/reg,segreg 


8C 
xx 1xxxxx 
Not used 


80 
mod reg rim 
[disp](disp) 
LEA reg,addr 


8E 
mod Oss rim 
(disp](disp] 
MOV segreg, mem/reg 


8E 
xx 1 xxxxx 
Not used 


8F 
mod 000 rim 
(disp](disp] 
POP mem/reg 


8F 
xx 001 
xxx 
Not used 


8F 
xx 010 
xxx 
Not used 


8F 
xx all 
xxx 
Not used 


8F 
xx 100 xxx 
Not used 


8F 
xx 101 xxx 
Not used 


8F 
xx 110 xxx 
Not used 


8F 
xx 111 
xxx 
Not used 


Not used 


90 
NOP 


91 
XCHG AX,CX 
92 
XCHG AX,DX 


93 
XCHG AX,BX 
94 
XCHG AX,SP 


95 
XCHG AX,BP 


96 
XCHG AX,SI 


97 
XCHG AX,DI 


98 
CBW 


99 
CWO 


9A 
kk 
jj hh gg 
CALL addr 


9B 
WAIT 


9C 
PUSHF 


90 
POPF 


9E 
SAHF 


9F 
LAHF 


AO 
qq 
pp 
MOV AL,addr 


A1 
qq 
pp 
MOV AX,addr 


A2 
qq 
pp 
MOVaddr,AL 


A3 
qq 
pp 
MOVaddr,AX 


A4 
MOVS BYTE 


A5 
MOVS WORD 


A6 
CMPS BYTE 


A7 
CMPS WORD 


A8 
kk 
TEST, AL,kk 
A9 
kk 
jj 
TEST AX,jjkk 
AA 
STOS BYTE 


AB 
STOS WORD 


AC 
LODS BYTE 


AD 
LODS WORD 


AE 
SCAS BYTE 


AF 
SCAS WORD 


Object 
Code 
Mnemonic 


Byte * 0 
Byte * 1 
Succeeding 
Bytes 


BO 
kk 
MOV AL,kk 


Bl 
kk 
MOV CL,kk 


B2 
kk 
MOV DL,kk 


B3 
kk 
MOV BL,kk 


B4 
kk 
MOV AH,kk 


B5 
kk 
MOV CH,kk 


B6 
kk 
MOV DH,kk 


B7 
kk 
MOV BH,kk 


B8 
kk 
li 
MOV AX,likk 


B9 
kk 
li 
MOV CX,likk 
BA 
kk 
li 
MOV DX,jjkk 


BB 
kk 
jj 
MOV BX,jjkk 


BC 
kk 
jj 
MOV SP,jjkk 


BD 
kk 
jj 
MOV BP,jjkk 


BE 
kk 
li 
MOV Sl.likk 


BF 
kk 
jj 
MOV DI,likk 
CO 
Not used 
Cl 
Not used 
C2 
kk 
li 
RET jjkk 
C3 
RET 


C4 
mod reg rim 
[disp)[disp) 
LES reg,addr 
C5 
mod reg rim 
[disp)[disp) 
LOS reg,addr 


C6 
mod 000 rim 
[disp)[disp) 
kk 
MOV mem,kk 


C6 
xx 001 
xxx 
Not used 
C6 
xx 010 
xxx 
Not used 


C6 
xx all 
xxx 
Not used 


C6 
xx 100 xxx 
Not used 


C6 
xx 101 
xxx 
Not used 


C6 
xx 110 
xxx 
Not used 


C6 
xx 111 xxx 
Not used 


C7 
mod 000 rim 
[disp)[disp) 
kkjj 
MOV mem,jjkk 


C7 
xx 001 
xxx 
Not used 
C7 
xx 010 
xxx 
Not used 
C7 
xx all 
xxx 
Not used 
C7 
xx 100 
xxx 
Not used 
C7 
xx 101 
xxx 
Not used 


C7 
xx 110 
xxx 
Not used 
C7 
xx 111 
xxx 
Not used 
C8 
Not used 
C9 
Not used 


CA 
kk 
li 
RET jjkk 


CB 
RET 


CC 
INT 3 


CD 
type 
INT Type 
CE 
INTO 


CF 
IRET 


Object 
Code 
Mnemonic 
Byte * 0 
Byte * 1 
Succeeding 
Bytes 


DO 
mod 000 
rIm 
[disp)[disp) 
ROL mem/reg,l 
(byte) 


DO 
mod 001 
rIm 
[disp)[disp) 
ROR mem/reg,l 
(byte) 


DO 
mod 010 
rim 
[disp)[disp) 
RCL mem/reg,l 
(byte) 


DO 
mod 011 
rIm 
[disp)[disp) 
RCR mem/reg,l 
(byte) 


DO 
mod 100 rIm 
[disp)[disp) 
SAL or SHL mem/reg,l 
(byte) 


DO 
mod 101 rIm 
[disp)[disp) 
SHR mem/reg,l 
(byte) 


DO 
xx 110 xxx 
Not used 


DO 
mod 111 rIm 
[disp)[disp) 
SAR mem/reg,l 
(byte) 
Dl 
mod 000 
rIm 
[disp)[disp) 
ROL mem/reg,l 
(word) 
Dl 
mod 001 
rim 
(disp)[disp] 
ROR mem/reg,l 
(word) 


Dl 
mod 010 
rIm 
[disp)[disp) 
RCL mem/reg,l 
(word) 
Dl 
mod 011 rIm 
[disp)[disp) 
RCR mem/reg,l 
(word) 
Dl 
mod 100 rIm 
[disp)[disp) 
SAL or SHL mem/reg,l 
(word) 
Dl 
mod 101 rIm 
[disp)[disp) 
SHR mem/reg,l 
(word) 
Dl 
xx 110 xxx 
Not used 


Dl 
mod 111 rim 
[disp)[disp) 
SAR mem/reg,l 
(word) 
D2 
mod 000 rIm 
[disp)[disp) 
ROL mem/reg,CL 
(byte) 
D2 
mod 001 
rIm 
[disp)[disp) 
ROR mem/reg,CL 
(byte I 
D2 
mod 010 rIm 
[disp)[disp) 
RCL mem/reg,CL 
(byte) 
D2 
mod 011 rIm 
[disp)[disp) 
RCR mem/reg,CL 
(byte) 
D2 
mod 100 rIm 
[disp)[disp) 
SAL or SHL mem/reg,CL 
(byte) 


D2 
mod 101 rIm 
[disp)[disp) 
SHR mem/reg,CL 
(byte) 
D2 
xx 110 xxx 
Not used 


D2 
mod 111 rIm 
[disp)[disp) 
SAR mem/reg,CL 
(byte) 
D3 
mod 000 rIm 
[disp)[disp) 
ROL mem/reg,CL 
(word) 
D3 
mod 001 rIm 
[disp)[disp) 
ROR mem/reg,CL 
(word) 
D3 
mod 010 rIm 
[disp)[disp) 
RCL mem/reg,CL 
(word I 
D3 
mod 011 rIm 
[disp)[disp) 
RCR mem/reg,CL 
(word) 
D3 
mod 100 rIm 
[disp)[disp) 
SAL or SHL mem/reg,CL 
(word) 


D3 
mod 101 rIm 
[disp)[disp) 
SHR mem/reg,CL 
(word) 
D3 
xx 110 xxx 
Not used 
D3 
mod 111 rIm 
[disp)[disp] 
SAR mem/reg,CL 
(word) 
D4 
OA 
AAM 


D5 
OA 
AAD 


D6 
Not used 


D7 
XLAT 


D8 
mod xxx rim 
(disp)[disp] 
ESC mem/reg 


D9 
mod xxx rIm 
[disp)[disp] 
ESC mem/reg 


DA 
mod xxx rIm 
[disp)[disp] 
ESC mem/reg 


DB 
mod xxx rIm 
(disp)[disp] 
ESC mem/reg 


DC 
mod xxx rIm 
[disp)[disp) 
ESC mem/reg 


DD 
mod xxx rIm 
[disp)[disp) 
ESC mem/reg 


DE 
mod xxx rIm 
[disp)[disp) 
ESC mem/reg 


DF 
mod xxx rIm 
(disp)[disp] 
ESC mem/reg 


EO 
disp 
LOOPNE/LOOPNZ disp 
El 
disp 
LOOPE/LOOPZ disp 


E2 
disp 
LOOP disp 


E3 
disp 
JCXZ disp 


E4 
kk 
IN AL,kk 


E5 
kk 
IN AX,kk 
E6 
kk 
OUT kk,AL 


E7 
kk 
OUT kk,AX 


E8 
disp 
disp 
CALL disp16 


E9 
disp 
disp 
JMP disp16 


Object 
Code 
Mnemonic 


Byte * 0 
Byte * 1 
Succeeding 
Bytes 


EA 
kk 
jj hh gg 
JMP addr 
EB 
disp 
JMP disp 


EC 
IN AL,DX 


ED 
IN AX,DX 


EE 
OUT DX,AL 


EF 
OUT DX,AX 
FO 
LOCK 


Fl 
Not used 


F2 
REPNE or REPNZ 
F3 
REP or REPEor REPZ 
F4 
HLT 
F5 
CMC 
F6 
mod 000 rim 
(disp)(disp] kk 
TEST mem/reg,kk 
F6 
xx 001 xxx 
Not used 


F6 
mod 010 rim 
(disp)(disp] 
NOT mem/reg 
(byte) 


F6 
mod 011 rim 
[disp)(disp] 
NEG mem/reg 
(byte) 
F6 
mod 100 rim 
[disp)(disp] 
MUL mem/reg 
(byte) 


F6 
mod 101 rim 
(disp)(disp] 
IMUL mem/reg 
(byte) 


F6 
mod 110 rim 
(disp)(disp] 
DIV mem/reg 
(byte) 
F6 
mod 111 rim 
(disp)(displ 
IDIV mem/reg 
(byte) 
F7 
mod 000 
rim 
(disp)(disp] kkjj 
TEST mem/reg,jjkk 
F7 
xx 001 
xxx 
Not used 


F7 
mod 010 
rim 
(disp)(disp] 
NOT mem/reg 
(word) 
F7 
mod 011 
rim 
(disp)(disp] 
NEG mem/reg 
(word) 
F7 
mod 100 rim 
(disp)(disp] 
MUL mem/reg 
(word) 
F7 
mod 101 rim 
(disp)(disp] 
IMUL mem/reg 
(word) 
F7 
mod 110 rim 
(disp)(disp] 
DIV mem/reg 
(word) 


F7 
mod 111 rim 
(disp] [disp] 
IDIV mem/reg 
(word) 
F8 
CLC 


F9 
STC 


FA 
CLI 


FB 
STI 


FC 
CLD 
FD 
STD 
FE 
mod 000 rim 
(disp)(disp] 
INC mem/reg 
(byte) 
FE 
mod 001 rim 
[disp)(disp] 
DEC mem/reg 
(byte) 
FE 
xx 010 
xxx 
Not used 


FE 
xx 011 
xxx 
Not used 


FE 
xx 100 xxx 
Not used 
FE 
xx 101 xxx 
Not used 
FE 
xx 110 xxx 
Not used 


FE 
xx 111 xxx 
Not used 


FF 
mod 000 rim 
(disp)(disp] 
INC mem/reg 
(word) 
FF 
mod 001 rim 
[disp)(disp] 
DEC mem/reg 
(word) 


FF 
mod 010 
rim 
[disp)(disp] 
CALL mem/reg 
FF 
mod 011 
rim 
[disp)(disp] 
CALL mem 
FF 
mod 100 rim 
(disp)(disp) 
JMP mem/reg 
FF 
mod 101 rim 
[disp)(disp] 
JMP mem 


FF 
mod 110 rim 
(disp)(disp] 
PUSH mem 


FF 
xx 111 xxx 
Not used 


The 8086 
and 8088 
Family 


AC and DC Characteristics 


and Signal Waveforms 


• 
8086 CPU 
• 
8088 CPU 
• 
8282/8283 Octal Latch 


• 
8284 Clock Generator 


• 
8286/8287 Octal Bus Transceiver 


• 
8288 Bus Controller 


• 
8289 Bus Arbiter 


808618086-4 
8086-2 


Symbol 
Parameter 
Min. 
Mu. 
Min. 
Max. 
Unlls 
Test 
CondlUonl 


TCLCL 
CLK 
Cycle 
Period 
- 
8086 
200 
500 
'25 
500 
ns 


- 
8086·4 
250 
500 


TCLCH 
CLK 
low 
Time 
(2/3 TCLCl) 
- 
15 
Ph TClCL)-15 
ns 


TCHCl 
ClK 
High 
Time 
(113 TClCl)+ 
2 
(lf3 TClCl)+ 
2 
ns 


TCH1CH2 
Cl~ 
Rise Time 
10 
'0 
ns 
From 
1.0V 10 3.5V 


TCL2Cl1 
CLK 
Fall 
Time 
10 
10 
ns 
From 
3.5V 10 1.0V 


TDVCl 
Data In Setup 
Time 
30 
20 
ns 


TCLDX 
Data 
In Hold 
Time 
'0 
10 
ns 


TR1VCL 
ROY Setup 
Time 
inlo 
8284 (See Notes 
1, 2) 
35 
35 
ns 


TCLR1X 
ROY Hold 
Time 
into 
8284 (See Notes 
1. 2) 
0 
0 
ns 


TRYHCH 
READY 
Selup 
Time 
Into 8086 
Ph TCLCL)-15 
(lh TClCL) 
- 
15 
ns 


TCHRYX 
READY 
Hold 
Time 
Into 8086 
30 
20 
ns 


TRYlCl 
READY 
Inactive 
to CLK (See Note 
4) 
-8 
-8 
ns 


T1NVCH 
Setup 
Time ~ecognllion 
30 
'5 
ns 
(INTR, 
NMf, 
TEST) (See Note 
2) 


TGVCH 
RQ/GT Setup 
Time 
30 
15 
ns 


TCHGX 
RQ Hold 
Time 
Into 8086 
40 
30 
ns 


TIMING 
RESPONSES 
8086/8086·4 
8086·2 


Symbol 
Par.meter 
Min. 
Max. 
Min. 
Max. 
Units 
Test 
Conditions 


TCLML 
Command 
Active 
Delay 
(See Note 
1) 
'0 
35 
'0 
35 
ns 


TCLMH 
Command 
Inactive 
Delay 
(See Note 
1) 
'0 
35 
'0 
35 
ns 


TAYHSH 
READY Active 
to Status 
Passive 
(See Note 3) 
110 
65 
ns 


TCHSV 
Status 
Active 
Delay 
'0 
110 
'0 
60 
ns 


TClSH 
Status 
Inactive 
Delay 
10 
'30 
'0 
70 
ns 


TCLAV 
Address 
Valid 
Delay 
'0 
110 
'0 
60 
ns 


TCLAX 
Address 
Hold 
Time 
'0 
'0 
ns 


TCLAZ 
Address 
Float 
Delay 
TeLAX 
80 
TCLAX 
50 
ns 


TSVlH 
Status 
Valid 
to ALE 
High 
(See Nole 
1) 
15 
'5 
ns 


TSVMCH 
Slat us Valid 
to MCE 
High 
(See Note 
1) 
15 
'5 
ns 


TCLLH 
CLK 
Low 
to ALE Valid 
(See Note 
1) 
15 
'5 
ns 


TCLMCH 
CLK 
Low 
10 MCE High 
(See Note 
1) 
'5 
'5 
ns 


TCHLL 
ALE 
Inactive 
Delay 
(See Note 
1) 
'5 
'5 
ns 
CL = 20-100 pF lor 


TCLMCL 
MCE Inactive 
Delay 
(See NOle 
1) 
'5 
15 
ns 
all 8086 Outputs 
(In addition 
to 
TCLDV 
Data Valid 
Delay 
'0 
110 
10 
60 
ns 
8086 sell-lOad) 


TCHDX 
Data Hold 
Time 
'0 
10 
ns 


TCVNV 
Control 
Active 
Delay 
(See Note 
1) 
5 
45 
5 
45 
ns 


TCVNX 
Control 
Inactive 
Delay 
(See Note 
1) 
'0 
45 
'0 
45 
ns 


TAZRL 
Address 
Float 
to Read Active 
0 
0 
ns 


TCLAL 
AD Active 
Delay 
'0 
'65 
'0 
'00 
ns 


TCLAH 
AD Inactive 
Delay 
'0 
'50 
'0 
80 
ns 


TAHAV 
AD Inactive 
to Nell:t Address 
Acllve 
TCLCL-45 
TCLCL-40 
ns 


TCHDTL 
Direction 
Control 
Active 
Delay 
(See Note 
1) 
50 
50 
ns 


TCHDTH 
Direction 
Control 
Inactive 
Delay (See Note 
1) 
30 
30 
ns 


TCLGL 
rrr Active 
Delay 
0 
85 
0 
50 
ns 


TCLGH 
GT Inactive 
Delay 
0 
85 
0 
50 
ns 


TALRH 
AD Width 
2TCLCL-75 
2TCLCL-50 
ns 


NOTES: 
1. Signal 
at 8284 or 8288 shown 
for reference 
only. 


2. Setup 
requirement 
lor asynchronous 
signal 
only 
to guarantee 
recognition 
at next 
CLK. 


3. Applies 
only 
to T3 and wait 
slates. 
4 
Applies 
only 
to T2 stale 
(8 ns into 
T3) 


''''., 
.rOt, -, 7",. 
"/~ 
IS 
~,j 
~ 
111;11/"0/ 


.), 
I, 


·COMMENT: 
Stresses 
above 
those 
listed 
under 
"Absofule''Maxim 


Aalings" 
may cause 
permanent 
damage 
to the device. 
ThiS ;$ a SI(~S 


r.ting 
only and functional 
operation 
of the device 
at these 
or any C4fler 
conditions 
above 
those 
indicated 
in the 
operational 
sections 
of IhtS 


specificallon 
is not implied. 
Exposure 
to absolute 
maximum 
rating 
con· 
dillons 
for elCtended periods 
may affect 
device 
reliability. 


Ambient Temperature Under Bias. 
Storage Temperature 
. 


Voltage on Any Pin with 
Respect to Ground. 


Power Dissipation 
... 


8086: TA= O·C to 70·C, Vcc= 5V ± 10% 
8086·2/8086·4: 
TA=O·C t070·C, 
Vcc=5V 
±5'10 


Symbol 
Par.met.r 
Min. 
Mall. 
Unit, 
T•• t Condition, 
- 
V'l 
I"put Low Voltage 
-0.5 
+0.8 
V 


V'H 
Input High Voltage 
2.0 
Vcc+0.5 
V 


VOL 
Output Low Voltage 
0.45 
V 
10l= 2.0 mA 


VOH 
Output High Voltage 
2.4 
V 
10H= -400~A 


Ice 
Power Supply Current 
8086/8086·4 
340 
mA 
TA=25·C 
8086·2 
350 
mA 


III 
Input Leakage Current 
±10 
~A 
OV< V'N < Vee 


ILO 
Output Leakage Current 
±10 
~A 
0.45V" 
Vour" 
Vee 


Vel 
Clock Input Low Voltage 
-0.5 
+0.6 
V 


VCH 
Clock Input High Voltage 
3.9 
Vee+1.0 
V 


Capacitance of Input Buffer 
C,N 
(All input except 
10 
pF 
fe= 1 MHz 
ADD- AD,s, AQ/GT) 


C,O 
Capacitance of 110Buffer 
20 
pF 
Ic= 1 MHz 
(ADD- AD,s, RQIGT) 


8086: 
TA=0°Ct070°C,Vcc=5V 
±10% 


8086-2/8086-4: 
TA=O°C 
to 70°C, Vcc=5V 
±5% 


808618086-4 
8086-2 


Symbol 
Parameter 
Min. 
Mu. 
Min. 
Max, 
Units 
Test Conditions 


TClCL 
eLK 
Cycle 
Period 
- 
8086 
200 
500 
125 
500 
ns 
-- 
8086·4 
250 
500 


feLCH 
eLK 
Low Time 
(2/3 TCLCL) 
- 
15 
(213 TClCL) 
- 
15 
os 


TCHCL 
elK 
High Time 
(Y) TCLCL) 
+ 2 
('13 TCLCL) 
+ 2 
ns 


TCH1CH2 
eLK 
Rise Time 
10 
10 
ns 
From 
1.0V 10 3.5V 


TCL2CL 
1 
eLK 
Fall Time 
10 
10 
ns 
From 
3.5V to 1.QV 


TOVeL 
Data In Setup 
Time 
30 
20 
ns 


TCLDX 
Data In Hold Time 
10 
10 
ns 


TR1VCL 
ROY Setup 
Time 
into 8284 (See Notes 
1, 2) 
35 
35 
ns 


TCLR1X 
ROY Hold Time 
into 8284 (See Noles 
1, 2) 
0 
0 
ns 


TAYHCH 
READY 
Setup 
Time 
into 
8086 
(2/3 TCLCL)-15 
(213 TCLCL) 
- 15 
ns 


TCHRYX 
READY Hold Time 
into 
a086 
30 
20 
ns 


TRYLCL 
READY 
Inactive 
to eLK 
(See Nole 
3) 
-8 
-8 
ns 


THVGH 
HOLD Setup 
Time 
35 
20 
ns 


TINVGH 
INTA, 
NMI, TEST Setup 
Time 
(See Note 
2) 
30 
15 
ns 


TIMING 
RESPONSES 
808618086·4 
8088·2 


Symbol 
Parameter 
Min. 
M.x. 
Min. 
Mall:. 
Unit. 
T•• t Condition. 


TCLAV 
Address 
Valid 
Delay 
10 
110 
10 
60 
ns 


TCLAX 
Address 
Hold 
Time 
10 
10 
ns 


TCLAZ 
Address 
Float 
Delay 
TCLAX 
80 
TCLAX 
50 
ns 


TLHLL 
ALE Width 
TCLCH-20 
TCLCH-10 
ns 


TCLLH 
ALE 
Active 
Delay 
60 
50 
ns 


TCHLL 
ALE 
Inactive 
Delay 
85 
55 
ns 


TLLAX 
Address 
Hold Time 
to ALE Inactive 
TCHCL-10 
TCHCL-10 
ns 


TCLDV 
Data Valid 
Delay 
10 
110 
10 
60 
ns 
CL = 20·100 pF lor 


TCHDX 
Data Hold 
Time 
10 
10 
ns 
all 8086 Outputs 
(In addlllon 
to 


TWHDX 
Data 
Hold 
Time 
After 
WR 
TCLCH-30 
TCLCH-30 
ns 
8088 self·load) 


TCVCTV 
Control 
Active 
Delay 
1 
10 
110 
10 
70 
ns 


TCHCTV 
Control 
Active 
Delay 
2 
10 
110 
10 
60 
ns 


TCVCTX 
Control 
Inactive 
Delay 
10 
110 
10 
70 
ns 


TAZRL 
Address 
Float 
to READ 
Active 
0 
0 
ns 


TCLRL 
Fm Active 
Delay 
10 
185 
10 
100 
ns 


TCLAH 
R'O Inactive 
Del.y 
10 
160 
10 
80 
ns 


TAHAV 
~ 
Inacllve 
to Next Address 
Active 
TCLCL-45 
TCLCL 
-40 
n. 


TCLHAV 
HLOA 
Valid 
Del.y 
10 
180 
10 
100 
n. 


TRLRH 
'lllWld'h 
2TCLCL-75 
2TCLCL-50 
n. 


TWLWH 
~Wld1h 
2TCLCL-60 
2TCLCL-40 
n. 


TAVAL 
Addrell 
Valid 
to ALE 
Low 
TCLCH-80 
TCLCH-40 
n. 


NOTES: 
1. Slgn.l.t 
82a. 
shown 
for 
reference 
only. 


2. Setup 
requirement 
for lIynchronoul 
Ilgnal 
only 
to guarantee 
r.cognltlon 
at next 
ClK. 
3. Appll •• 
only 
to T2 Itat •. (8 nl 
Into T3) 


READ 
CYCLE 


(NOTE 
11 


(ViA, 
IHTA = VOH) 


INTA 
CYCLE 
OTiR 


(NOTES 
1&3) 


REi, fl=VOM 
lmt",Vod 


WRITE 
CYCLE 
r 
AD,,-ADo 


(NOTE I)I 
De. 


(RD, INTA, 


DTI'A=VOMI 


ViR 


NOTES: 
1. 
ALL 
SIGNALS 
SWITCH 
BETWEEN 
VOM AND 
VOL UNLESS 
OTHERWISE 
SPECIFIED. 


2 
ROY 
IS SAMPLED 
NEAR 
THE 
END 
OF 
T2, T3, Tw 
TO 
DETERMINE 
IF Tw 
MACHINES 
STATES 
ARE 
TO 
BE 
INSERTED. 


3. 
TWO 
INTA 
CYCLES 
RUN 
BACK· 
TO· BACK. 
THE 
8086 
LOCAL 
ADDRIDATA 
BUS 
IS 


FLOATING 
OURING 
BOTH 
INTA 
CYCLES. 
CONTROL 
SIGNALS 
SHOWN 
FOR 
SECOND 
INTA 
CYCLE. 


4. 
SIGNALS 
AT 
8284 
ARE 
SHOWN 
FOR 
REFERENCE 
ONLY. 


S. 
ALL 
TIMING 
MEASUREMENTS 
ARE 
MADE 
AT 
1.5V 
UNLESS 
OTHERWISE 
NOTED. 


I 


ALE 
(1288 
OUTPUn 


see NOTE 5 


ROY 
(1284 
INPUT) 


••• 
DY ('''''NPUT) 
{ 


I 


TCLMH---I 


TSVMCH- 
- 
.- 
d- 


\V 
INVALID 
ADDRESS 
A~ 
_ 


-TC-CA-'--j~~i- 


1. ALL 
SIGNALS 
SWITCH 
BETWEEN 
VOH AND 
VOl 
UNLESS 
OTHERWISE 


SPECIFIED 


2. 
ROY IS SAMPLED 
NEAR 
THE 
END OF '" 
T). Tw TO DETERMINE 
IF Tw 


MACHINES 
STATES 
ARE 
TO 1£ 
INSERTED. 


3. 
CASCADE 
ADDRESS 
IS VALID 
eeTWEeN 
FIRST 
AND 
SECOND 
tHTA 
CYCLE. 


4. 
TWO 
INT ••• CYCLES 
RUN 
lACK· 
YO-lACK. 
THE ION 
LOCAL 
AOORIDATA 
IUS 
1$ 


FlOAtiNG 
DURING 
10TH 
INT ••• CYCLES. 
CONTROL 
FDA 
POINTER 
ADDRESS 


IS SHOWN 
FOR 
SECOND 
INTA 
CYCLE. 


S. SIGNALS 
AT 1214 OR 1218 ARE 
SHOWN 
FDA 
REFERENCE 
ONLY, 


I. 
THE ISSUANCE 
OF THE 1211 
COMMAND 
AND 
CONTROL 
SIGNALS 
(~. 


JifRR,.urwe.~, 
~,~, 
iRTl 
AND 
DEN) LAGS THE ACTIVE 
HIGH 


IU.CEN. 


1. ALL 
TIMING 
MEASUREMENTS 
ARE 
MADE 
AT 
1.5\1 UNLESS 
OTHERWISE 
NOTED. 


I. 
STATUS 
INACTIVE 
IN STATE 
JUST 
PRIOR 
TO T•. 


C,"\ 
r\ 
I :3fL_:::::: 


I"'"" 
: 


NOTE: 


1. SETUP 
REQUIREMENTS 
FOR ASYNCHRONOUS 
SIGNALS 
ONLY 
TO GUARANTEE 
RECOGNITION 
AT NEXT 
eLK 


Asynchronous 5Igl'l.1 Recognition 


r-AnYC'MC'C,e 
·.r- 
'OClMC.<,._1 


".~~~~ 


TC~Gt<_ 
_ 
-_I 
,. 
·lGVC" 
• 
.-.----~Tclcl---_I 
I 


_ --~ 
TClCl __ 
--! 
• 
TC"G~ 
• 
-ITC,Gll- 
-I· 
IClet< 


Ro,eT 
~_ 
cO~~~~sE~~OIlr ~ 
F"lS(l 
I 
-r-- 
!co::~~:SlSOA 
--f 
-~ 
~ 
I 
AELEASE 


-_11-",", 
I 
===C 
=: ====}--E 


ADWAOo 


~~~S, 


1Hl1$' 
>---------------.-< 


,~- 
~ \ elM CYClE_; 
1_ 
lOA 2 C'CLES • -I 
,,"~JL~~ 


_ 
• 
·THYCH 
_I 
I 
",co.----r- 
'\ 
! 


~ 
r.-TCltU'VI 


01.0"'''00 
",t/,-.A, ••.S, 


~.S~~'~N >----------<=:f"'" 


BUS 
CS 


INTERFACE 
55 


UNIT 
OS 


IP 


A·BUS 


AH 
Al 


BH 
Bl 


CH 
Cl 


EXECUTION 
OH 
Ol 


UNIT 
SP 


BP 


51 


~ 
01 


8088 PIN 
DIAGRAM 


MIH 
I MAX I 


MOCE 
MOCE 


GNO 
Vcc 


A15 


A13 
A161S3 


A12 
All/54 


All 
Al8JS5 


AIO 
A19/S6 


A' 
SSO 
(HIGH) 


A' 
MNIMX 


A07 
AD 


AD' 
HOLD 
IRQ/errol 


A05 
HLDA 
t~/e-Th 


AD' 
WR 
(LOCK) 


A03 
101M 
iS2) 


A02 
oTiR 
/S1) 


AOI 
1m> 
ISO) 


ADO 
ALE 
tOSOI 


NMI 
mn 
lQS11 


INTR 
mr 


ClK 
READY 


GHD 
RESET 


Ambient 
Temperature 
Under Bias. 
.O·C to 70'C 
Storage 
Temperature. 
. ... 
- 65'C 
to 
+ 150'C 


Voltage 
on Any 
Pin with 
Respect 
to Ground 


Power 
Dissipation 


'.1'1)$,/'\'$ i'l~ 
~ 
hli~. "D 
rl 


II, 
#~ 


'COMMENT 
Stresses 
above 
those ·llsted 
under 
"A~~lule 
MaxI 


Ratmgs" 
may cause 
permanent 
damage 
to the devIce 
Thts-~s a'lt"s 


rating 
only and funcltonal 
operation 
01 the device 
at these 
or a';,. olhi, 


conditions 
above 
those 
Indicated 
In the operational 
sections 
of lh~ 


speCificatIon 
1$ nol Implied 
Exposure 
10 absolute 
maKimum 
rallng 
can· 


dllions 
lor extended 
penods 
may alleet 
deVIce reliability 


Symbol 
Parameter 
Min. 
Max. 
Units 
Test 
Conditions 


V,L 
Input 
Low Voltage 
-0.5 
+ 0.8 
V 


V,H 
Input 
High 
Voltage 
2.0 
Vee + 0.5 
V 


VOL 
Output 
Low Voltage 
0.45 
V 
IOL = 2.0 mA 


VOH 
Output 
High 
Voltage 
24 
V 
IOH = 400_A 


Ice 
Power 
Supply 
Current 
340 
mA 


III 
Input 
Leakage 
Current 
'" 10 
_A 
V1N 
;:; 
Vcc 


' 
LO 
Output 
Leakage 
Current 
'" 10 
_A 
0.45V '" Your 
'" Vee 
- 
- 
-- - - ----- 
- 


veL 
Clock 
Input 
Low Voltage 
-0.5 
+ 0.6 
V 
- 


VeH 
Clock 
Input 
High 
Voltage 
39 
Vcc+1.0 
V 


Capacitance 
of Input 
Buffer 


C,N 
(All input 
except 
10 
pF 
fc = 1 MHz 


ADo-AD, 
RQ/GT) 


C,O 
Capacitance 
of I/O Buffer 
20 
pF 
Ic = 1 MHz 
(ADo-AD, 
RQ/GT) 
------ 
----- 
- 


- 


Symbol 
Parlmlt.r 
Min. 
MIx. 
Unlll 
Test Condition. 


TCLCL 
eLK Cycle 
Period 
200 
500 
ns 


leLCH 
eLK 
low 
Time 
(2hTCLCl)-15 
ns 


lCHCL 
eLK 
High Time 
('hTClCL)+ 
2 
ns 


TCH1CH2 
eLK 
Rise Time 
10 
ns 
From 
10V 103 5V 


TCl2Cll 
eLK 
Fall Time 
10 
ns 
From35Vlol.0V 


TOveL 
Data In Setup Time 
30 
ns 


TelDX 
Dala In Hold Time 
10 
ns 


TR1VGl 
ROY Setup 
Time 
Into 8284 (See Notes 
1. 2) 
35 
ns 


TCLR1X 
ROY HOld Time 
InlO 8284 (See Noles 
1, 21 
0 
ns 


TRYHCH 
AEADY 
Setup 
Time 
In!o 8088 
(lI3TClCll-15 
ns 


TCHRVX 
READY Hold rime 
Into 8088 
30 
ns 


TAYLCL 
READY 
Inacllve 
10 elK ISee 
Note 
3) 
-8 
ns 


THVCH 
HOLD 
Setup 
Time 
• 
35 
ns 


T1NVGH 
INTFI. NMI. TEST Setup rime 
(See Note 2) 
30 
"' 


TCLAZ 


TLHLl 


lClLH 


lCHll 


lClRL 
---- 


lClRH 


TRHAV 


TCLHAV 


TRlRH 


TWlWH 


lAVAL 


Cl = 2Q.H>O pF 
lor 
all 8088 Outputs 
In addlhon 
to 
internal 
loads 


Control 
Aclive 
Delay 1 
._-_.__ .. _ ..__ ..._._._. 


Control 
Active 
Delay 2 


Control 
Inactive 
Delay 


Address 
Float to READ Active 
------------ 


AD Active 
Delay 


AD Inactive 
Delay 


AD Inactive to Nelli Address 
Aclive 


10 


10 


TCLCL-45 


10 


2TCLCL-75 


2TCLCL-60 


TCLCH-60 


AD Width 


WR Width 


Address 
Valid 
to ALE Lo .•••• 


NOTES: 
1 
Signal 
at 8284 shown 
lor reference 
only. 


2 
Selup 
requirement 
lor asynchronous 
signal 
only to guaranlee 
recognition 
at nellt CLK. 


3 
Applies 
only 
10 T2 stale 
(8 ns i!'llo f3 stale). 


READ 
CYCLE 


(NOTE 
1) 


(WJt. flilTl.VOH) 


IAD,-AD, 


WRITE 
~~~~~ I 
De. 


W. 


INTA 
CYCLE 


NOTES 
1,3 


(JiD. W'R=lJoHl 


SQFTWAAEHALT-(DEN.. 
~ 


VOl;RD.WR,iiii'i 
DT/A:z 
YOH: 
"07 
_ AOo 


TelAY 
----~---------- 


NOTES: 
1. 
All 
SIGNALS 
SWITCH 
BETWEEN 
IIOH 
AND 
VOl 
UNLESS 
OTHERWISE 
SPECIFIED. 


2. 
ROY 
1$ SAMPLEO 
NEAR 
THE 
END 
OF 12. 13. Tw 
TO 
OETERMINE 
IF 
Tw 


MACHINES 
STATES 
ARE 
TO 
BE 
INSERTED. 


3. 
TWO 
INTA 
CYCLES 
RUN 
SACK·TO·BACK. 
THE soaa 
LOCAL 
AODRJDATA 


8US 
IS 
FLOATING 
DURING 
80TH 
INT'" 
CYCLES. 
CONTROL 
SIGNALS 


ARE 
SHOWN 
FOA 
THE 
SECOND 
INTA 
CYCLE. 


4. 
SIGNALS 
AT 
'284 
ARE 
SHOWN 
FOR 
REFERENCE 
ONLY. 


5. 
ALL 
TIMINO 
MEASUREMENTS 
ARE 
MADE 
AT 
1.511 UNLESS 
OTHERWISE 


NOTED. 


Symbol 
P.r.meter 
Mln_ 
M.a, 
Units 
Tut 
Condll(OIJs 


TCLCL 
CLK Cycle 
Period 
200 
500 
os 
",/. 
---- 


TCLCH 
CLK Low Time 
(2hTCLCL)-15 
os 


TCHCL 
CLK High Time 
(VJTCLCL)+ 
2 
os 


TCH1CH2 
CLK RIse Time 
10 
ns 
From 
1.0V 10 3 5V 


TCL2CL1 
ClK 
Fall Time 
10 
ns 
From35Vl010V 


TDVCL 
Data In Setup 
Time 
30 
ns 


TCLoX 
Data In Hold Time 
10 
ns 


TR1VCL 
ROY Setup 
Time Into 8284 (See Notes 
1. 2) 
35 
ns 


TCLR1X 
ROY Hold 
Time mto 8284 (See Notes 
1, 2) 
0 
ns 


TRYHCH 
READY Setup 
Time mto 8088 
(lhTCLCL)-15 
ns 


TCHRYX 
READY Hold T,me mto 8088 
30 
ns 


TRYlCL 
READY Inactive 
to CLK (See Note 4) 
-8 
ns 


TINVCH 
Setup Time for Recognition 
(INTR. NMI. TESn (See Nole 
Z) 
30 
ns 


TGVCH 
RQIGT Setup 
Time 
30 
ns 


TCHGX 
AQ Hold Time Into 8086 
40 
ns 
.- 
-- 
- 
-- - 


TIMING 
RESPONSES 
-------- 
----~------~---- 


Symbol 
P.r.meter 
Min, 
M.a. 
Unils 
Tul 
Conditions 
- ---------- 


TCLML 
Command 
Active 
Delay (See Note 
1) 
10 
35 
--_.--------- 
------- 


TCLMH 
Command 
Inactive 
Delay (See Note 
1) 
10 
35 
------- 
- -------- 
.- ------ 


TRYHSH 
READY Active 
to Status 
Passive (See Note 3) 
110 
os 
-------------- 


TCHSV 
Stalus 
Acllve 
Delay 
10 
110 
os 
------- 
-- 
-------- 


TCLSH 
Status 
Inactive 
Delay 
10 
130 
---- 
---------- 


TCLAV 
Address 
Valid Delay 
15 
110 
-- 
._-------- 


TCLAX 
Address 
Hold TIme 
10 
----- 
- --- ---------- 


TCLAZ 
Address 
Floal 
Delay 
TCLAX 
80 
---------- 
- - 


TSVLH 
Stalus 
Valid 
10 ALE HIgh (See Nole 
1) 
15 
ns 


TSVMCH 
Stalus 
Valid to MCE High (See Note 
1) 
15 
------- 


TCLLH 
CLK Low 10 ALE Valid iSee Note 
1) 
15 


TCLMCH 
CLK Low 10 MCE High (See Nole 
1) 
15 
----- 


TCHLL 
ALE Inactive 
Delay (See Nole 
1) 
15 
CL = 20-100 pF lor 


TCLMCL 
MCE Inacllve 
Delay (See Note 1) 
'5 


all 8088 Outputs 
------ 
In addlllon 
to 


TCLDV 
Data Valid Delay 
'5 
110 
Inlernal 
loads 


TCHDX 
Data Hold Time 
10 


TCVNV 
Conlrol 
ActIve 
Delay (See Nole 
1) 
5 
45 
ns 
----- 


TCVNX 
Conlrollnacllve 
Delay (See Note 
1) 
10 
45 


TAZRL 
Address 
Float to Read Acllve 
---- 


TCLRL 
RD Active 
Delay 
10 
'65 
----- 
-~--- 


TCLRH 
RD Inactive 
Delay 
10 
'50 
ns 


TAHAV 
AD Inactive 
10 Next Address 
Active 
TCLCL-45 


TCHDTL 
Dlfecllon 
Control 
Active 
Delay (See Note 
1) 
50 
ns 
----- 


TCHoTH 
Dtrecllon 
Conlrollnactive 
Delay (See Note 1) 
30 
----- 
------- 


TCLGL 
GT Active 
Delay 
110 
---- 
GT Inactive 
Delay 
TCLGH 
85 
ns 
---- 


ADWidlh 
TRLRH 
2TCLCL-75 
----- 
----- 


NOTES: 
1 
Signal 
al 8284 or 8288 shown 
lor reference 
only 


2. 
Setup 
reQulfement 
lor asynchronous 
signal 
only to guarantee 
recognition 
al next CLK 


3. 
Applies 
only to T3 and wait states. 


4 
Applies 
only to T2 state 18 ns into T3 state). 


IAlElm. 
OUTPUn 


SEE NOTE 5\ 


ROY 
18284 
INPUT) 


Ollllt 


1211 
OUTPUTS Iiii'Jrne OR 
~ 


SEE NOTES 5,1 


DEN 


~----- 
TCVNX-I 
~- 
IOFTWAIlE 
"ALT_"'"'Y·"7~~:7~~'··Y·~"YA""""'s 


~ 
jr---------r, ------- 
\~-~ 
\_----- 


NOTES: 
1. ALL 
SIGNALS 
SWITCH 
'ETWEEN 
VOH AND 
VO~ UNLESS 
OTHERWISE 
SPECIFIED. 


2. 
IlOY 
IS SAMPLED 
NE ••••• THE 
END 
OF 
12. 
T,. 
TW TO 
DfTEAMINE 
IF Tw 
MACHINES 
:iTATES 
AilE 
TO 8£ 
INSERTED. 


3. CASCAOE 
ADDRESS 
IS 
VAUO 
BETWEEN 
FIAST 
AND 
SECOND 
INTI. 
CYCLES. 
. 


4. 
TWO 
INTA 
CYCLES 
RUN 
•••• CK·TO·.ACK. 
THE 
Iota 
LOCAL 
AOOllloATA 
8US 
IS 
FLOATING 
DUJUNG 
80TH 
INTA 
CYCLES. 
COHTROL 
FDA 
POINTEIl 
ADDRESS 
IS SHOWN 
FOR SECOND 
INTA 
CYCLE. 


S. SK)NAlS 
AT 12 •• 
OR 12. ARE 
SHOWN 
FOR 
AEfERENCE 
ONLY . 


•• 
THE 
ISSUANCE 
OF 
THE 
12M 
COMMAND 
AND 
CONTROL 
SIGNALS 
~.IiIWTC. 
oOiIWC,mRe, mwc.ImWe,1llTJ. 
AND DEN! LAOS THE 
ACTIVE 
HIOH 
1281 
CEN. 


t. All 
TIMING 
MEASUIIlEMENTS 
ARE 
MADE 
AT 1.5'11UNLESS 
OTHERWise 
NOTED . 


•. 
STATUS 
INACTIVE 
IN STATE 
JUST 
PRIOR 
TO T. 


"'~ 
f'1Jt1~ 


,., 


I··-:=;{ 


-- 
1.I\IVC"'I ••••• 
OII'1 


IH11l 


TEST 


HOTE 


1 SETUP REOUIREMENTS 
FOR ASVNCMRO'-OUS 
SIGNALS 
ONl' 
TO CiUAR.••NHE 
RECOGN,flO," 
.••1 Hel' 
ell( 


Asynchronous Signal Recognition 
0~--~ 
C~II 


- 
TCL"vI4_- 
-_hCL,I,VI4_ 


""'~ 


Bus Lock Signa' Timing (Maximum Mode Only) 


.•.•.• 
,cu<c"',* -- 
OCl"C,.., 


"'~ 
~~ 
~ 


TeLa" 
!elel 
~ 


TGye", 


~" 


---- 
- 
fCHel 


~ 
~ 


RO,OT 
COPI'IO'=-fSSOR 
,COPROCUSOII 
"' 


10nOT 
ItElUSf 


"' •••ou.,"'" 
: e 


.'''''~~'~'': 
, 
:- 
}-E 
"0,-1000 
"" 
COPIIOCESSOIl 


I"J.I'i.Sii 
"'.""" 


(SU 
NOTE " 


Non. 
1 ~~~;~":~H~~:I:~~~~~~~I:~l~~: 
IU$SU OUTSIDETHEIIEOIOH 


Request/Grant Sequenca Timing (Maximum Mode Only) 


__ 
~'ClIIC'Cll_· ~-A'AfL 
"'~J - 


THyeH 
tSfEHOU.'I 
- 
1MYC" 


""O~ 
"\ 
I 


I 
I 


~ 
TelHA' 
- 
I-TelHA' 


HLO'" 
J 
\J 


.- 
TCLAl 
"..~:m" k 
"" 
IX 


Hold/Hold Acknowledge Timing (Minimum Mode Only) 


PIN DEFINITIONS 


Pin 


STROBE 
(Input). 
STB 
is 
an 
input 
control 


pulse 
used 
to strobe 
data 
at the data 
input 


pins 
(Ao-A,) 
into 
the 
data 
latches. 
This 


signal 
is active 
HIGH 
to admit 
input 
data. 


The 
data 
is 
latched 
at 
the 
HIGH 
to 
LOW 


transition 
of STB. 


OUTPUT 
ENABLE 
(Input). 
DE is 
an 
input 


control 
signal 
which 
when 
active 
LOW 


enables 
the 
contents 
of 
the 
data 
latches 


onto 
the 
data 
output 
pin (Bo-B,). 
OE being 


inactive 
HIGH 
forces 
the 
output 
buffers 
to 


their 
high 
impedance 
state. 


DATA 
INPUT 
PINS 
(Input). 
Data 
presented 


at 
these 
pins 
satisfying 
setup 
time 
reo 


quirements 
when 
STB 
is 
strobed 
and 


latched 
into 
the 
data 
input 
latches. 


000-00, 
(8282) 
500-50, 
(8283) 


DATA 
OUTPUT 
PINS 
(0UIPutl-"WIi 


true, 
the 
data 
in 
the 
data 
iatch~S 
~ 


sented 
as 
inverted 
(8283) 
or 
non-Inve 


(8282) data 
onto 
the 
data 
output 
pins. 


The 
8282 
and 
8283 
octal 
latches 
are 8·bit 
latches 
with 


3·state 
output 
buffers. 
Data 
having 
satisfied 
the 
setup 


time 
requirements 
is 
latched 
into 
the 
data 
latches 
by 


strobing 
the 
STB 
line 
HIGH 
to 
LOW. 
Holding 
the 
STB 


line 
in its 
active 
HIGH 
state 
makes 
the 
latches 
appear 


transparent. 
Data is presented 
to the data output 
pins 
by 


activating 
the 
OE input 
line. 
When 
DE is inactive 
HIGH 
the 
output 
buffers 
are 
in 
their 
high 
impedance 
state. 


Enabling 
or disabling 
the 
output 
buffers 
will 
not 
cause 


negative-going 
transients 
to appear 
on 
the 
data 
output 


bus. 


D.C. AND 
OPERATING 
CHARACTERISTICS 


ABSOLUTE 
MAXIMUM 
RATINGS· 


Temperature 
Under 
Bias. 
. O·C to 70·C 


Storage 
Temperature 
-6S·C 
to 
+ 1S0·C 


All 
Output 
and Supply 
Voltages 
- O.SV to + 7V 


All 
Input 
Voltages 
- 
1.0V to + S.SV 


Power 
Dissipation 
1 Watt 


D.C. CHARACTERISTICS 
FOR 8282/8283 


Conditions: 
Vcc; 
SV ± S%, T.; 
O·C to 70·C 


'COMMENT: 
Stresses 
above 
those 
listed 
under 
"Absolute 
Maximum 


Aatings" 
may 
cause 
permanent 
damage 
to the device. 
This 
is a slress 


rating 
only 
and functional 
operation 
of the device 
at these 
or any olher 


conditions 
above those 
indicated 
in the operalional 
seclions 
of this 


specification 
1$ not implied. 
Exposure 
to absolute 
maximum 
rating 
con- 


ditions 
for extended 
periods 
may alleci 
device reliability. 


Symbol 
Parlmeter 
Mln 
Max 
Units 
Test 
Conditions 


Ve 
Input 
Clamp 
Voltage 
-1 
V 
Ie; 
-S 
mA 


Ice 
Power 
Supply 
Current 
160 
mA 


IF 
Forward 
Input 
Current 
-0.2 
mA 
VF ; 
0.4SV 


IA 
Reverse 
Input 
Current 
SO 
~A 
VA ; 
S.2SV 


VOL 
Output 
Low 
Voltage 
O.SO 
V 
10L ; 
32 mA 


VOH 
Output 
High 
Voltage 
2.4 
V 
10H; 
-S 
mA 


10FF 
Output 
Off 
Current 
± SO 
~A 
VOFF ; 
0.4S to S.2SV 


V,L 
Input 
Low 
Voltage 
0.8 
V 
Vee; 
S.OV 
See Note 
1 


V,H 
Input 
High 
Voltage 
2.0 
V 
Vee; 
S.OV 
See Note 
1 


F; 
1 MHz 
C,N 
Input 
Capacitance 
12 
pF 
VB1'S;2.SV, 
Vee; 
SV 


T.;2S·C 


D., 
Vee 


D., 
DO, 


D., 
DO, 


D., 
DO, 


DI, 
DO, 


D., 
DO, 


D', 
DO, 


D', 
DO, 


DE 
DD, 


GND 
ST8 


D., 
Vee 


DI, 
500 


D., 
00; 


D., 
DOi 


DI, 
00, 


D., 
00. 


D., 
00, 


D., 
00. 
or 
00, 


GND 
STI 


DATA 
IN 


DATA OUT 


OUTPUT EHAILE 


STAOIE 


A.C. CHARACTERISTICS 
FOR 828218283 


Conditions: 
vcc = 5V± 5%, T. = O·C to 70·C 


Loading: 
Outputs 
- 
10L = 32 mA, IOH = - 5 mA, CL = 300 pF 


Symbol 
Parameter 
Min 
Ma. 
Units 
Test Conditions 


TIVOV 
Input 
to Output 
Delay 
(See Note 
1) 


-Inverting 
25 
ns 
-Non·lnverting 
35 
ns 


TSHOV 
STS to Output 
Delay 


-Inverting 
45 
ns 


-Non-Inverting 
55 
ns 


TEHOZ 
Output 
Disable 
Time 
25 
ns 


TELOV 
Output 
Enable 
Time 
10 
50 
ns 


T1VSL 
Input 
to STS Setup 
Time 
0 
ns 


TSLIX 
Input 
to STS Hold 
Time 
25 
ns 


TSHSL 
STS High 
Time 
15 
ns 


INPUT'4 ~ 


••- 
:=,~,,",j 


TSHSl~-' 
-----------------------_ 


TELOV 
L- 


_.::!.o~-~~_~ 


I.SV 
'" 


2.14V 


-1- 
~- 
~"" 


OUT 
OUT 
OUT 
r 


30 
O" 
r 300" 
r~" 


3·STATE 
TO 
VOL 
3-STATE 
TO 
VOH 
SWITCHING 


D.C. AND OPERATING CHARACTERISTICS 


ABSOLUTE MAXIMUM RATINGS· 


Temperature 
Under 
Bias. 
. 
O·Ct070·C 


Storage 
Temperature. 
.. -65'C 
to + 150'C 


All Output 
and Supply 
Voltages. 
. 
- O.5V to + 7V 


All 
Input 
Voltages. 
. 
- 1.0V to + 5.5V 


Power 
Dissipation 
1 Watt 


CYSNC 


PClK 


AEN1 


ROYl 


READY 


RDY2 
Jm<2 


CLK 


GND 


X2 


TNK 


EFI 


Fie 


OSC 


RES 


RESET 


XI 


XTAL 
x, 
OSCIL· 


LATOR 


rANI( 


,il! 


EFI 


'COMMENT: 
Stresses 
above 
thOse 
listed 
under 
"Absolute 
Maximum 


Ratings" 
may 
cause 
permanent 
damage 
10 the device. 
This 
is a slress 


rating 
only 
and functional 
operation 
of the device 
at lhese 
or any other 
conditions 
above those 
indicated 
in the operational 
sections 
of this 


specificatioQ 
is nol 
implied. 
Exposure 
to absolute 
maximum 
rating 
con- 


ditions 
lor extended 
periods 
may alleet 
device 
reliability. 


XI 
I 


X2' 


TANK 


FIC 


EFI 


CSYNC 


ROY1 
I 


RDY2 
I 


AEN11 
AOOI 


RES 


RESET 


OSC 


CLK 


PCLK 


READY 


VCC 
GND 


CONNECTIONS 
FOR 
CRYSTAL 


USED 
WITH 
OVERTONE 
CRYSTAL 


CLOCK 
SOURCE 
SELECT 


EXTERNAL 
CLOCK 
INPUT 


CLOCK 
SYNCHRONIZATION 
INPUT 


READY 
SIGNAL 
FROM 
TWO 
MUL TIBUS 


TfIll 
SYSTEMS 


ADDRESS 
ENABLED 
QUALIFIERS 
FOR 
RDY1.2 


RESET 
INPUT 


SYNCHRONIZED 
RESET 
OUTPUT 


OSCILLATOR 
OUTPUT 
MOS 
CLOCK 
FOR 
THE PROCESSOR 


TTL 
CLOCK 
FOR 
PERIPHERALS 


SYNCHRONIZED 
READY 
OUTPUT 


+5 
VOLTS 


o VOLTS 


D.C. CHARACTERISTICS 
FOR 8284 


Conditions: 
T. = O'C 
to 70'C; 
Vee = 5V:t 
10% 


Symbol 
Parama.ar 


A.C. CHARACTERISTICS 
FOR 8284 


Conditions: 
T. = O'C 
to 
70'C; 
Vcc = 5V ± 10% 


TIMING 
REQUIREMENTS 


Symbol 
P.ramet.r 


TEHEl 
Elilernal 
Frequency 
High 
Time 


TELEH 
Elllernal 
Frequency 
Low Time 


TElEL 
EFI Period 


Xl AL FreQuency 


TA1VGl 
ROY1, ACY2 
Set-Up 
10 CL!<- 


TClR1X 
ROY1. ACY2 
Hold 
10 elK 


TA1VR1V 
AEN1, A!N!Set·Up 10 AOY1, ACY2 


TCLA1X 
AEN1, 
AEN2 
Hold 
10 elK 


TYHEH 
CSYNC 
Set-Up 
10 EFI 


TEHYl 
CSVNC 
Hold 
10 EFI 


TYHYL 
CSYNC 
Width 


TllHCl 
RES Set·Up to elK 


TCU1H 
RES Hold 
10 elK 


Forward 
Input 
Current 


Reverse 
Input 
Current 


Input 
Forward 
Clamp 
Voltage 


Power 
Supply 
Current 


Input 
lOW 
Voltage 


Input 
HIGH 
Voltage 


Reset 
Input 
HIGH 
Voltage 


Output 
LOW 
Voltage 


Output 
HIGH 
Voltage 
ClK 


Other 
Outputs 


RES Input 
Hysteresis 


Symbol 


TCLCl 


TCHCL 


TeLCH 


TCH1CH2 
TCl2Cll 


TPHPL 


TPlPH 


JAYLel 


TAYHCH 


TCUl 


elK 
Cycle 
Period 


elK 
HIgh TIme 


elK 
low 
Time 


pelK 
HIgh Time 


pelt<. 
Low 
Time 


Ready 
InaclIve 
to eLK (See ~~ 


Ready ActIve to elK 
(See Nole 3) 


eLK 
10 Reset 
Delay 


,.",.,.,,~,,;~,~ 


'-'rt'ltC'J 
~~ 


Mln 
Max 
Unltl 
Tel' 
Condition 
I 
'. ,...• 


-0,5 
mA 
VF = 0.45V 
'/, 
;. 


50 
y.A 
VR= 
5,25V 


-1,0 
V 
le= 
- 5 mA 


140 
mA 


0.8 
V 
Vee= 
5,OV 


2,0 
V 
Vee= 
5,OV 


2,6 
V 
Vee= 
5,OV 


0.45 
V 
5 mA 


4 
V 
-1 
mA 


2.4 
V 
-1 
mA 


0,25 
V 
Vee= 
5,OV 


Mln 


13 


13 


TEHEL 
.•.TELEH .•. d 


'2 


35 
o 


'5 


Units 
T•• t Condition. 


90% 
- 90% 
VIN 


10% - 
10% VIN 


(Nole 
1) 


MH' 


n, 


{Note 
2j 


(Note 
2) 


Unlls 
Test Conditions 


FIg 3 &. Fig 4 


Fig 3 &. Fig 4 


1 OV to 3 5V 


Fig 
5 & Fig 6 


Fig 5 &. Fig 6 


20 


20 


2 TELEL 


65 


20 


-- 


(V:JTClCL) 
.•. 20 


_~~,TCL~~~- 
1_~.~ 


TClCL- 
20 


TClCL 
- 20 
-s 


-- 


- - 
~--- 
- 
-- 


;::.( 
~J\Jl .. 


osc oJ\Jl.. 
:~::: 


_TCLlI"_I_Tfl"CL_ 


PIN DEFINITIONS 
Pin 
Description 


TRANSMIT 
(Input). 
T 
is 
an 
input 
control 


signal 
used 
to control 
the direction 
of the 
transceivers. 
When 
HIGH, 
it configures 
the 


transceiver's 
80-87 
as outputs 
with Ao-A] 
as inputs. 
T LOW configures 
Ao-A7 
as the 


outputs 
with 
Bo-B7 
serving 
as the 
inputs. 


OUTPUT 
ENABLE 
(Input). 
OE 
is an 
input 


control 
signal 
used to enable 
the appropri· 


ate output 
driver 
(as selected 
by T) onto 
its 
respective 
bus. This signal 
is active 
LOW. 


LOCAL 
BUS 
DATA 
PINS 
(Input/Output). 


These 
pins 
serve to either 
present 
data 
to 
or accept 
data 
from 
the 
processor's 
local 
bus depending 
upon 
the state 
of the T pin. 


D.C. AND OPERATING 
CHARACTERISTICS 


ABSOLUTE 
MAXIMUM 
RATINGS· 


Temperature 
Under Bias 
O·C to 70·C 


Storage 
Temperature. 
. 
-65·C 
to + 150·C 


All Output 
and Supply 
Voltages. 
. - 0.5V to + 7V 


All Input 
Voltages. 
- 1.0V to + 5.5V 
Power 
Dissipation. 
.1 Watt 


Bo-B7 
(8286) 
B,;-"B, 


(8287) 


U<J,.. 
.""" 


SYSTEM 
BUS 
DATA 
PINS 
(Inp 


These 
pins 
serve to either 
p'resehl 
de 


or 
accept 
data 
from 
the 
system 
bus 
d 


pending 
upon the state 
of the T pin. 


The 8286 
and 
8287 
transceivers 
are 8·bit 
transceivers 


with 
high 
impedance 
outputs. 
With 
T active 
HIGH 
and 
OE active 
LOW, data at the Ao-A7 
pins is driven 
onto the 


Bo-B7 
pins. 
With 
T inactive 
LOW and DE active 
LOW, 


data at the Bo-B7 
pins 
is driven 
onto 
the Ao-A7 
pins. 
No 


output 
low 
glitching 
will 
occur 
whenever 
the 
trans· 
ceivers 
are 
entering 
or 
leaving 
the 
high 
impedance 


state. 


·COMMENT: 
Stresses 
above 
those 
listed 
under 
"Absolute 
Maximum 


Ratings" 
may cause 
permanent 
damage 
10 the device 
ThIS IS a stress 


rating only and functional 
operation 
of the device 
at these or any other 


conditions 
above 
those 
indicated 
in lhe 
operatIonal 
sections 
01 thiS 


specification 
is not implied. 
El(posure 
tQ absolule 
maximum 
rating con· 


ditions 
lor extended 
periods 
may affect 
device 
reliability. 


Symbol 
Parameter 
Min 
Max 
Units 
Test Conditions 


Vc 
Input 
Clamp 
Voltage 
-1 
V 
le= 
-5 mA 


Ice 
Power Supply 
Current-8287 
130 
mA 


-8286 
160 
mA 


IF 
Forward 
Input Current 
-0.2 
mA 
VF = 0.45V 


IA 
Reverse 
Input Current 
50 
"A 
VA = 5.25V 


VOL 
Output 
Low Voltage 
-BOutputs 
0.5 
V 
IOL=32 
mA 
-A 
Outputs 
0.5 
V 
10L= 10 mA 


VOH 
Outpul 
High 
Voltage 
-B 
Outputs 
2.4 
V 
10H= -5 mA 


-A 
Outputs 
2.4 
V 
10H= -1 mA 


IOFF 
Output 
Off Current 
IF 
VOFF= 0.45V 


IOFF 
Output 
Off Current 
IA 
VOFF= 5.25V 


V'L 
Input 
Low Voltage 
-A 
Side 
08 
V 
Vcc = 5.0V, See Note 1 
...,.B Side 
0.9 
V 
Vee= 
5.0V. See Note 1 


V,H 
Input 
High 
Voltage 
20 
V 
Vec = 5.0V. See Note 
1 


F = 1 MHz 


C'N 
Input 
Capacitance 
12 
pF 
VB1AS=2.5V. 
Vcc=5V 


TA= 
25"C 
- 


CYSNC 
VCC 


PCLK 
X, 


AEN1 
X2 


RDYl 
TNK 


READY 
EFI 


RDY2 
Fie 


AEN2 
OSC 


CLK 
RES 


GNO 
RESET 


ROYl 
~ 
AENl ----~po____i__I 


UN2 
t>"===,, 
ROY2 
-- 
---L.J 


XII 
X2' 
TANK 


FIC 


EFI 


CSYNC 


RDYl 
I 


RDY2 
I 


AEN11 
AEN21 


RES 


RESET 


OSC 


CLK 


PCLK 


READY 


VCC 
DND 


CONNECTIONS 
FOR 
CRYSTAL 


USED 
WITH 
OVERTONE 
CRYSTAL 


CLOCK 
SOURCE 
SELECT 


EXTERNAL 
CLOCK 
INPUT 


CLOCK 
SYNCHRONIZATION 
INPUT 


READY 
SIGNAL 
FROM 
TWO 
MULTIBUSr 
••• SYSTEMS 


ADDRESS 
ENABLED 
QUALIFIERS 
FOR 
RDY1.2 


RESET 
INPUT 


SYNCHRONIZED 
RESET 
OUTPUT 


OSCILLATOR 
OUTPUT 


MOS 
CLOCK 
FOR 
THE PROCESSOR 


TTL CLOCK 
FOR 
PERIPHERALS 


SYNCHRONIZED 
READY 
OUTPUT 


+ 5 VOLTS 
o VOLTS 


A.C. CHARACTERISTICS 
FOR 8286/8287 


Conditions: 
Vcc ; 
5V ± 5%, 
TA 
; 
O·C to 70·C 


loading: 
B Outputs 
- 
10L 
; 
32 mA, 
10H 
; 
- 5 mA, CL 
; 
300 pF 


A Outputs 
- 
10L 
; 
10 mA, 
10H; 
- 1 mA, CL = 100 pF 


Symbol 
Parameter 
Mln 
Ma. 
Units 
Test 
Conditions 


T1VOV 
Input 
to Output 
Delay 
Inverting 
25 
ns 
(See Note 1) 


Non-Inverting 
35 
ns 


TEHTV 
Transmit/Receive 
Hold Time 
TEHOZ 
ns 


TTVEL 
Transmit/Receive 
Setup 
30 
ns 


TEHOZ 
Output 
Disable 
Time 
25 
ns 


TELOV 
Output 
Enable 
Time 
10 
50 
ns 


1.'UT5 __ 
t 
_ 


6E 
1 


j-nvov-l 
.. 
TEHOZ 
.• 
~- 


OUTPUTS 
~¥ 
~I__ y~~~-- 
1_ 
TEHTV 
- 
-_I 
______ 
~x 
_ 


TEST LOAD 
CIRCUITS 


1,5V 
t,SV 
2.14V 
~- 
~- 
~.,o 


OUT 
OUT 
OUT 
r 
300 
., 
roo" 
roo" 


3·srATe 
TO VOL 
3·STATE 
TO 
VOL 
SWITCHING 


B OUTPUT 
A OUTPUT 
B OUTPUT 


1.SV 
1.SV 
2,28V 
~- 
~- 
~ ..• 


OUT 
OUT 
OUT 
r 300.' 
roo" 
roo" 


)-STATE 
TO 
VOH 
)·STATE 
TO 
VOH 
SWITCHING 


B OUTPUT 
A OUTPUT 
A OUTPUT 


D.C. AND 
OPERATING 
CHARACTERISTICS 


ABSOLUTE 
MAXIMUM 
RATINGS· 


Temperature 
Under 
Bias 
Storage 
Temperature 
All Output 
and Supply Voltages. 


All Input VOltages 
Power 
Dissipation 


.. O·C 
10 70·C 
- 6S·C 
10 + lSO·C 


. - O.SV to + 7V 


- 
1 OV to + 5.5V 
1.5 Wall 


'COMMENT: 
Stresses 
above 
those 
listed 
under 
"Absolute 
MaximUfTl 


Ratings" 
may cause 
permanent 
damage 
10 the device. 
ThIs 
tS a sires, 


rating 
only and functional 
operation 
01 the device 
at these 
or any other 


conditions 
above those 
indicated 
in the operational 
sections 
01 this 


specification 
is not implied. 
Ellposure 
10 absolute 
mallimum 
rating con· 


ditions 
lor extended 
periods 
may allect 
device 
reliability 


Symbol 


Ve 
'ee 
'F 
'A 
Val 


Parameter 


Input Clamp Voltage 


Power Supply 
Current 


Forward 
Input 
Current 


Reverse Input Current 


Output 
Low voltage-Command 
OutpulS 


Control 
Outputs 


Output 
High VOllage- 
Command 
Outputs 


Control 
Outputs 


Max 
Unit 
Test Conditions 
-, 
IC- 
-5mA 


230 
mA 


-0,7 
mA 
VF = 0 45V 


50 
.A 
VR- 
VCC 


0.5 
V 
IOL-32 
mA 
0.5 
V 
tOL= 
16 mA 


V 
10H- 
-5 
mA 


V 
10H= 
-1 
mA 


08 


'00 
,.A 
VOFF - 0.4 to 5.25V 


A.C. CHARACTERISTICS 
FOR THE 8288 
Conditions: 
Vee= 5V ± 10%, 
TA = O·C 
10 70·C 


TIMING 
REOUIREMENTS 


InpUI Low Vottage 


Input High Voltage 


Output 
Off Current 


Symbol 
Parameter 
Min 
Max 
Unil 
Loading 


TCLCL 
CLK Cycle 
Period 
125 


TCLCH 
CLK Low Time 
-- 


66 


TCHCL 
CLK High Time 
'0 


TSVCH 
Status 
ActIve 
Setup Time 
65 
-- 


TCHSV 
Status 
Active 
Hold Time 
'0 


TSHCL 
Status Inactive 
Setup Time 
55 
-- 


TCLSH 
Status 
Inactive 
Hold Time 
10 
---- 


TIMING 
RESPONSES 


Symbol 


TCVNV 


TCVNX 


TCLLH, TCLMCH 


TSVLH, TSVMCH 


TCHLL 


TCLML 


TClMH 


TCHOTl 


TCHOTH 


TAElCH 


TAEHCZ 


TAElCV 


TAEVNV 


TCEVNV 


TCElRH 


Parameter 


Conlrol 
Active 
Delay 


Control 
Inactive 
Detay 


ALE MCE Active 
Delay (from CLK) 


ALE MCE Aclive 
Delay (from StaluS) 


ALE Inactive 
Delay 


Command 
Active 
Delay 


Command 
Inactive 
Delay 


Direction 
Control 
Acltve 
Delay 


Direction 
Control 
Inactive 
Delay 


Command 
Enable Time 


Command 
Disable 
Time 


Enable Delay Time 


AEN to DEN 


CEN 10 DEN, PDEN 


CEN to Command 


lOB [ '7"'-"'; 
] 
Vee 


elK 
[ 
2 
19 ] so 
51 [ 
3 
18 ] 52 


DT/A [ 
4 
17 :J MCEJPDEN 


ALE 
[ 
5 
16 :J DEN 


AEN 
[ 
6 
8268 
15 
] 
CEN 


MRDe 
[ 
7 
14 
] 
INTA 


AMWC 
[ 
8 
13 
] 
IORC 


MWTC 
[ 
9 
12 
] 
AIOWC 


GND[ 
10 
11 ]IOWC 


Max 
'5 
'5 


15 


15 
'5 


35 


35 


50 


30 
'0 
'0 


275 


20 


2() 


TClMl 


MADe I 
10RC 
MWTe 
IOWC 
INTA 
AMWC 
AIOWC 


tOl =~2 mA 
IOH= -5 
mA 


Cl=300 
pF 


1.5V 


."'~'. 


l'O<l" 


3·STATE 
TO 
LOW 


-~"" 


1 
300 " 


3·STATE 
TO 
HIGH 


2.14V 


-~"" 


1 
300 " 


COMMAND 
OUTPUT 


TEST 
LOAD 
·"'~t"" 
lOO" 


CONTROL 
OUTPUT 
TEST 
LOAD 


-;"~f..'II,; 
,.'~ 


'Ie 
I:; 
~ 
1,,,,"0/ 
~ 
II~ 
./. 


COMMENT: 
Stresses 
abOve 
those 
listed 
under "A'alq.tOfe,Ma 
RatIngs·' 
may cause 
permanent 
damage 
10 the device ~,(icw"y'-e 


rating 
onty and funcllOnal 
operallon 
olthe 
device at these ofJ~y,FB'fbttl 


conditIons 
above 
those 
indicated 
In the operational 
secltons 
of~S" 


speCification 
1$not Implied. 
Exposure 
10 absolute 
maximum 
rallng coN. 


dltlons 
for extended 
periOds may alleel 
device 
relIability 


Temperature 
Under 
Bias 
O'C 
to 70'C 


Storage 
Temperature.. 
. .. - 65'C 
to 
+ 150'C 
All 
Output 
and 
Supply 
Voltages. 
. .. - 0.5V to + 7V 
All 
Input 
Voltages 
- 
1.0V to + 5.5V 


Power 
Dissipation. 
. 
1.5 Watt 


D.C. CHARACTERISTICS 
FOR THE 8289 


CONDITIONS: 
TA=O' 
to 70'C, 
Vee=5V 
± 10% 


Symbol 
Parameter 
Min. 
Max. 
Units 
Tast 
Condition 


Ve 
Input 
Clamp 
Voltage 
-1.0 
V 
Vee = 4.50V, 
Ie = - 5 mA 


IF 
Input 
Forward 
Current 
-0.5 
mA 
Vee = 5.50V, 
VF = 0.45V 


iR 
Reverse 
Input 
Leakage 
Current 
60 
~A 
Vee = 5.50, VR = 5.50 


VOL 
Output 
Low 
Voltage 
BUSY,CBRO 
0.45 
V 
IOl = 20 mA, 
Cl = 250 pF 1) 
AEN 
0.45 
V 
IOl = 16 mA, 
Cl = 100 pF 2) 


BPRO,BREO 
0.45 
V 
IOl= 
10 mA, 
Cl=60 
pF 3) 


VOH 
Output 
High 
Voltage 


BUSY', CBRO 
Open 
Collector 


All 
Other 
Outputs 
2.4 
V 
IOH = 400 ~A 


Ice 
Power 
Supply 
Current 
165 
mA 


V'l 
Input 
Low 
Voltage 
.8 
V 


V,H 
Input 
High 
Voltage 
2.0 
V 


Cin 
Status 
Input 
Capacitance 
25 
pF 


Cin (Others) 
Input 
Capacitance 
12 
pF 


1'00 


0 
' 
I~O' 


52 
vee 


iOB 


SYSB/RESB 
So 


RESS 
elK 


BCD< 
~ 


INIT 
CAQ[Cj( 


8iiRl 
ANYRQST 


BPRO 
Aal 


SPRN 
~ 


GNP 
BUSY 


CONDITIONS: 
Vcc=5V 
± 10%, 
TA=O'C 
to 70'C 


Timing 
Requirements 


Symbol 
Parameter 
Min. 
Max. 
Unit 


TCLCL 
CLK 
Cycle 
Period 
125 
ns 


TCLCH 
CLK 
Low 
Time 
65 
ns 


TCHCL 
CLK 
High 
Time 
35 
ns 


TSVCH 
Status 
Active 
Setup 
65 
TCLCL·10 
ns 


TSHCL 
Status 
Inactive 
Setup 
50 
TCLCL·10 
ns 


THVCH 
Status 
Active 
Hold 
10 
ns 


THVCL 
Status 
Inactive 
Hold 
10 
ns 


TBYSBL 
BUSYtlSetup 
to BCLKI 
20 
ns 


TCBSBL 
CBROIISetup 
to BCLKI 
20 
ns 


TBLBL 
BCLK 
Cycle 
Time 
100 
ns 


TBHCL 
BCLK 
High 
Time 
30 
.65[TBLBLJ 
ns 


TCLLL 1 
LOCK 
Inactive 
Hold 
20 
ns 


TCLLL2 
LOCK 
Active 
Setup 
40 
ns 


TPNBL 
BPRNllto 
BCLK 
Setup 
Time 
15 
ns 


TCLSR1 
SYSB/1ltSli 
Setup 
0 
ns 


TCLSR2 
SYSB/RESS 
Hold 
20 
ns 


TIVIH 
Initialization 
Pulse 
Width 
3 TBLBL+ 
ns 
3 TCLCL 


Symbol 
Parameter 
Min. 
Max. 
Unit 
Loading 


TBLBRL 
BCLK 
to BREO 
Delayll 
35 
ns 


TBLPOH 
BCLK 
to BPROll 
(See Note 
1) 
40 
ns 


TPNPO 
BPRNllto 
BPROIIDelay 
25 
ns 


(See Note 
1) 


TBLBYL 
BCLK 
to BUSY 
Low 
60 
ns 


TBLBYH 
BCLK 
to BUSY 
Float 
(See 
Note 
2) 
35 
ns 


TCLAEH 
CLK 
to AEN 
High 
65 
ns 


TBLAEL 
BCLK 
to AEN 
Low 
40 
ns 


TBLCBL 
BCLK 
to CBRQ 
Low 
60 
ns 


TBLCBH 
BCLK 
to CBRO 
Float 
(See 
Note 
2) 
35 
ns 


a Denotes 
thai 
spec 
applies 
10 both 
transitions 
01 the signaL 


NOTE 1: BCLK 
generates 
the first 
BPRO 
wherein 
subsequent 
BPAO 
changes 
lower 
in the chain 
are generated 
through 
SPRN. 


NOTE 
2: Measured 
at ,5V above 
GND. 


~ 
f 


OPERATION 


rmT VccATOV.10%-. 
_ 


TIVIH------. 


"""" __ 
\ __ 
"_"OHb=-' 
~_ 
....., 


,mrn." 
"HOC 
• 
• 
1 


,=:;, 
\: - ,- 4--------------- 


NOTES 
1. LOCK ACTive 
CAN OCCUR DURING ANY T STATE. AS tONG 
AS THE RELATIONSHIPS 
SHOWN ABOVE WITH RESPECT TO THE eLK 
ARE MAINTAINEO. 
LOCK INACTIVE 
HAS 
NO CRITICAL 
TIME 
AND CAN 
BE ASYNCHRONOUS. 


-~Cii. 
HAS NO CRITICAL TIMING AND 15 CON510EAEO 
AN ASYNCHRONOUS 
INPUT 
SIGNAL 


2 
GliTCH1NG 
OF 
SYSBIAtSa 
PIN 1$ PERMITTED 
DURING 
THIS 
TIME. 
AFTER 
oJ 2 Of 
Tl. 
AND 
BEFORE 
01 OF U. 
ONLY 
CtEAN 
TRANSITIONS 
ARE ACCEPTEO 
3 
AU! 
LEAOING 
EDGE 
15 RELATED 
TO 8m. 
TRAiliNG 
EDGE 
TO elK. 
THE 
TRAilING 


EOGE 
Of 
AEN 
OCCURS 
AFTER 
BUS 
PRIORITY 
IS lOST 


The signals 
related 
10 eLK 
are typical 
processor 
signalS. 
and do not relate 
to the depicted 
sequence 
of events 
at the 
Signals 
referenced 
10 BeLK. 
The signals 
shown 
related 
to the BCLK 
represent 
a hypothetical 
sequence 
of events 
tor 


illustration. 
Assume 
3 bus arbiters 
of priorities 
1. 2 and 3 configured 
in serial 
priority 
resolving 
scheme 
as shown 
in 
Figure 
6. Assume 
arbiter 
1 has the bus and is holding 
busy 
low. 
Arbiter 
#2 detects 
its processor 
wants 
the bus and 
pulls 
low BAEO#2, 
If BPAN#2 
is high (as shown). 
arbiter 
#2 will 
pull 
low CBRO 
line. CBAO 
signalS 
to the higher 
priOrity 
arbiter 
#1 that 
a lower 
priority 
arbiter 
wants 
the bus. {A higher 
priority 
arbiter 
would 
be granted 
BPAN 
when 
it makes 
the bus request 
rather 
than having 
to wait 
for another 
arbiter 
to release 
the bus through 
C'B11Ol .• ' Arbiter 
#1 will 
relin· 
quish 
the multi·master 
system 
bus when 
it enters 
a state 
not requiring 
it (see Table 
1). by lowering 
its BPAO#1 
(tied 
to 


BPRN#2) 
and releasing 
BUSY. Arbiter 
#2 now sees that 
it has priority 
from 
BPAN#2 
being 
low and releases 
CBRQ, 
As 
soon 
as BUSY signifies 
the bus is available 
(high), 
arbiter 
#2 pullS BUSY low on next 
falling 
edge of BCLK. 
Note 
that 
if 
arbiter 
#2 didn't 
want 
the bus at the lime 
it received 
priority, 
it would 
pass 
priority 
to the next 
lower 
priority 
arbiter 
by 
lowering 
its 
BPAO 
#2 [TPNPO\. 


The 8088 CPU 


The 8088 is an 8086 microprocessor 
with an 8-bit data bus. The two parts are 
otherwise 
identical. Therefore 
we will describe differences 
between the 8088 and the 
8086 in the text which follows. 


8088 
PROGRAMMABLE 
REGISTERS AND 


ADDRESSING 
MODES 


8088 programmable 
registers and addressing 
modes are identical to the 8086 in 
every way. 


8088 CPU pins and signals are illustrated in Figure 0-1. As compared to the 8086 
pins and signals illustrated in Figure 10-1, only pin 34 differs (with the exception of pins 
2-8 and 39 being address only). 
For the 8086, pin 34 outputs BHE. This signal discriminates 
between the high- 
order byte and the low-order byte on the 16-bit 8086 data bus. Since the 8088 has an 8- 
bit data bus, BHE and associated logic is irrelevant. The 8088 outputs maximum 
mode 
SO status at pin 34 (SSO). 
The 101M signal has opposite polarity for the 8088, as compared to the 8086. This 
makes the 8088 compatible with the 8085. 


GND 
A14 
A13 
A12 
All 
Al0 


A9 
A8 
AD7 
AD6 
AD5 
AD4 
AD3 
AD2 
ADI 
ADO 
NMI 
INTR 
CLK 
GND 


1 
40 


2 
39 


3 
38 
4 
37 
5 
36 
6 
35 
- 
7 
34 
~ 
8 
33 
9 
~2 
10 
8088 
31 - 
11 
CPU 
30 
- 
- 
12 
29 
- 
13 
28 
- 
14 
27 
15 
26 
16 
25 
- 
17 
24 
- 
18 
23 
19 
22 - 
20 
21 - 


VCC 
A15 
A16/S3 
A17/S4 
A18/S5 
A19/S6 


SSO 
MN/MX 
RD 
RO/GTO, HOLD 
RO/GT1, HLDA 
LOCK, WR 
S2,10/M 
Sl,DT/R 
SO, DEN 
OSO, ALE 
OSl, 
INTA 
TEST 
READY 
RESET 


Pin Name 
Description 
Type 


ADO-AD7 
Address/Data 
8us 
8idirectional. 
tristate 
A8-A15 
Address Bus 
Output. tristate 


A 16/S3, A 17/S4 
Address/Segment 
identifier 
Output, trisate 


A18/S5 
Address/Interrupt 
enable status 
Output. tristate 
A19/S6 
Address/status 
Output. tristal~ 
SSO 
Status output 
Output. tristate 


RD 
Read control 
Output. tristate 
READY 
Wait state request 
Input 


TEST 
Wait for test control 
Input 


INTR 
Interrupt request 
Input 


NMI 
Non-maskable interrupt request 
Input 
RESET 
System Reset 
Input 


CLK 
-<, 
System Clock 
Input 
MN/MX 
= GND for a maximum system 
SO. 51, S2 
Machine cycle status 
Output, tristate 
RO/GTO, RO/GTl 
Local bus priority control 
Bidirectional 
OSO,OSl 
Instruction queue status 
Output 


LOCK 
Bus hold control 
Output. tristate 


MN/MX 
= VCC for a minimum system 


10/M 
Memory or I/O access 
Output, tristate 
WR 
Write control 
Output, tristate 


ALE 
Address Latch enable 
Output 


DT/R 
'< 
Data transmit/receive 
Output, tristate 


DEN 
Data enable 
Output. tristate 
INTA 
Interrupt acknowledge 
Output. tristate 
HOLD 
Hold request 
Input 
HLDA 
Hold acknowledge 
Output 


VCC.GND 
Power, ground 
DMaximum System Signals 
DMinimum System Signal; 


101M 
DT/R sso 


0 
0 
0 
Code segment access 
0 
0 
1 
Memory read 
0 
1 
0 
Memory write 
0 
1 
1 
No operations 
1 
0 
0 
Interrupt acknowledge 
1 
0 
1 
110 read 
1 
1 
0 
1/0 write 
1 
1 
1 
Halt 


Since the 8088 has no BHE signals, nor need for any such signal, the discussion of 
external memory addressing and BHE given for the 8086 will not apply to the 8088. 


The 8088 has a 4-byte instruction 
object code queue; the 8086, in contrast, has a 
6-byte instruction 
object code queue. The 8088 will start executing instruction 
fetch bus 
cycles to fill its 4 byte queue as soon as one or more queue bytes are empty. The 8086, in 
contrast, will not start pre-fetching instruction 
object code bytes until two or more of its 
6 queue bytes are empty. The description 
of bus cycles and queue logic given for the 
8086 otherwise applies directly to the 8088. 


Bus cycle timing for the 8088 and the 8086 differ only at the multiplexed 
datal 
address bus cycles. Timing differences are confined to the eight address bus lines A8- 
A15, and may be illustrated as follows: 


8086 
ADO-AD15 
Data Out 
$ 
8088 
ADO-AD7 
~g 
8088 
A8-A15 
"tl 
ls 
co 
CD 
~ 
a: 
g 
0 


8086/8088 
E 
CD 
A16-A19 
Status 
ls 
::! 
~ 
0 
E 
8086 
ADO-AD15 
CD 


Data In 
::! 
8088 
ADO-AD7 


Apart from the fact that the 8088 has no BHE signal, all timing for signals other 
than the data/address 
bus is identical for the 8086 and the 8088. 


When operating in minimum 
mode, the 8088 delays the ALE pulse by one clock 
period as compared to 8086 timing. This may be illustrated as follows: 


8086 and 8088 logic is absolutely identical for the following states and logic: 


1. 
The Wait state 
2. 
The Hold state 


3.. 
RQ/GT logic 
4. 
Lock logic 
5. 
Wait for test state 


6. 
Processor escape 


7. 
Device reset 
8. 
Interrupt 
processing 


9. 
Single stepping mode 


The 8086 and 8088 instruction 
sets, listed in numerous 
tables in this book, are identical 
with the exception of execution 
times. Since the 8088 has an 8-bit bus, two bus cycles 
will have to be executed wherever the 8086 would have executed a single bus cycle to 
fetch 16 bits of data. Appendix A provides execution 
times for the 8086. 


AAA, 3-49-50, 
4-19 
flags, 3-27 
AACK. See Multibus 
AAD, 3-51- 
52, 4-30 
flags, 3-29 
AAM, 3-53-54, 
4-26 
flags, 3-29 
AAS, 3-55 - 56, 4-23 
flags, 3-27 
ADC 
ac-data, 3-57 - 58, 4-18 
flags, 3-26 
mem/reg-mem/reg, 
3-61-62, 
4-18 


mem/reg-data, 
3-59-60, 
4-18 


ADD 
ac-data, 3-63 - 64, 4-18 
flags, 3-26 
mem/reg-mem/reg, 
3-67 - 68, 4-18 


mem/reg-data, 
3-65 - 66, 4-18 


Address bus 
concepts, 7-10-16 
demultiplexed,7-10-11 
distributed 
demultiplexed, 
7-10 


local demultiplexed, 
7-10 


multiplexed, 
7-3 
Address calculation, 3-30- 31 


Index 


Addressing modes 
data addressing: 


base relative, 3-3, 3-37 - 39 
base relative direct indexed, 
3-39 


base relative direct stack, 3-40 
base relative implied, 3-38 
direct, 3-34 
direct indexed, 
3-35 
immediate, 
3-32 - 33 
implied, 3-36 
program addressing: 


program relative, 
3-31 
direct, 3-31 
indirect, 3-31 
ADRO-ADR13. See Multibus 
Algorithm, 
1-2 
ALD,7-28 
AND 
ac-data, 3-69-70, 
4-36 
flags, 3-28 
mem/reg-mem/reg, 
3-73 -74, 
4-36 


mem/reg-data, 
3-71 -72, 
4-36 
ASClI arithmetic examples 
addition, 4-20 
division, 4-28 
multiplication, 
4-25, 4-27 


Assembler, 
1-3 
definition of, 5-1 
functions of, 5-11 - 13 


BCD arithmetic examples 


addition, 4-20 
BCLK, 10-13. See also Multibus 
BHE, 7-4. See also Address bus 


usage in memory selection, 7-13-15 


BHE, 7-4. See also Address bus 
BHEN. See Multibus 
Binary arithmetic examples 


addition, 4-17 
subtraction, 
4-21 
32-bit multiply, 4-24 


BIU. See Bus Interface Unit 
Block control character, 
4-39 


Block move, 6-7 
BPRN, 10-7, 10-13. See also Multibus 
BPRO 10-13. See also Multibus 
BREQ, 10-5, 10-13. See also Multibus 
Breakpoint trap, 8-33 
Buffer, 5-4 
translation, 
4-13 
Buffer-to-buffer 
moves, 4-6-11 


Bus contention, 
7-20, 7-26-27 


Bus cycle, 7-8-10 
Bus Interface Unit (BIU), 3-2, 7-8 


activity with LOCK, 8-9 
operation of, 7-28-33 


Bus master, 
9-1 
Bus slave, 9-1 
BUSY, 10-7, 10-13 
Byte ordering in memory, 
4-17 


CALL,4-48 
addr, 3-75-76, 
4-50 


displ6, 
3-77-78, 
4-50 


flags, 3-26 
mem, 3-79-80, 
4-50 


mem/reg, 
3-81- 
82, 4-50 


CAS, 8-52 
Cascade address lines. See CAS 
CBRQ, 10-5, 10-13. See also Multibus 
CBW, 3-83, 4-30 
flags, 3-26 
CCLK. See Multibus 
Character pointer, 
5-4 


CLC, 3-84, 4-57 
flags, 3-28 
CLD, 3-85, 4-57 
flags, 3-28 
CLI, 3-86, 4-57 
flags, 3-28 
CLK. See 8284 timing signals 
CMC, 3-87, 4-57 
flags, 3-28 
CMP 
ac-data, 3-88 - 89, 4-32 


flags, 3-26 
mem/reg-mem/reg, 
3-92-93, 
4-32 


mem/reg-data, 
3-90-91, 
4-32 


CMPS, 3-94-95, 
4-45 


flags, 3-26 
Context switch, 4-14-15 
code for, 4-15 
execution time, 4-15 
Control information, 
1-6, 1-7 


Control Unit, 7-28 
Co-processor, 
10-1 
instructions, 
10-2- 3 
interface to 8086, 8-6, 10-2-3 


CPU, 1-2 
CWD, 3-96, 4-30 
flags, 3-26 


DAA,3-97-98 
flags, 3-27 
Daisy chain, 9·5, 9-9 
DAS, 3-99-100, 
4-23 
flags, 3-27 
Data bus 
buffered, 
7-16 
double buffering, 7-25 - 27 
local bus, 7-27 
multiplexed, 
7-3, 7-16-20 


system bus, 7-8, 7-27 
Data bus transceivers 
enable timing, 7-8 
Data memory, 
1-2 
Data movement, 
4-2 
Data translation, 
4-13-14 
translate buffer size, 4-13, 4-34 


DATO-DATF. See Multibus 
Debugger, 
1-11, 5-2 
DEC 


flags, 3-26 
mem/reg, 
3-101-102, 
4-22 


reg 3-103 -104, 
4-23 
Decoupling 
bulk, 9-7 
high frequency, 
9-7 
DIV, 3-105-106, 
4-29 
flags, 3-29 
Divide by zero, 8-32 
Documentation, 
1-9 
types, 1-13-14 


Editor, 5-1 
definition of, 5-3 
typical commands: 


Change String, 5-10 
Delete Data, 5-7 
Display, 5-8 
Insert Data, 5-6 
Move Character 
Pointer, 
5-8 


Read Data, 5-5 
Search Buffer, 5-9 
system commands, 
5-10- 
11 


Effective memory address, 
3-30 


8088 
address bus, 0-1 - 2 
addressing modes, 0-1 
bus cycle timing, 0-3 
control lines, 0-1 - 2 
data bus, 0-1 - 2 
instruction 
queue operation, 
0-3 


instruction 
set comparison 
with 8086, 0-4 


logic comparison 
with 8086, 0-4 


status decoding, 0-3 
status lines, 0-1 - 2 
8251A, 3-4-6 
character length, 3-5 
handshake 
protocol, 3-5, 3-12 


initialization, 
6-10 
status port definition, 
3-10 


8257 DMA, 8-57 
access to 8086 Bus, 8-57 
8-bit transfers, 
8-57 
16-bit transfers, 
8-62 
8259A 
cascade address transfer, 
8-38 - 40 


interface to 8086, 8-38 
master/slave 
configurations, 
8-38-42 


8282/8283 Octal latches, 10-5 


drive, 7-10 
operation, 
7-20 
timing, 7-10 
8284 
CLK buffering, 8-18 
clock delays, 8-14 
control lines, 8-11- 22, 10-5- 8 
crystal requirements, 
8-11 - 13 


debiasing capacitor, 8-12 
driving multiple 8284s, 8-14 
external frequency selection, 8-13 
input frequency requirements, 
8-11 


synchronizing 
multiple 8284s, 8-15 -17 


timing signals, 8-11-14, 
10-5 


8286/8287, Octal transceivers, 
7-20, 10-4 


capacitive loading, 7-20 
control, 7-20 
drive, 7-20 
timing, 7-20 
8288, 3-4 
command 
timing, 10-7 
control lines, 8-7-8, 
8-49-52, 


8-61,10-5,10-7,10-11 


interface with 8086, 8-3, 8-6 
operation 
in 
multiprocessor 
system, 
10-4-12 
resource mapping options, 
10-7-12 


SO, S1, S2 interface, 7-6 
8289, 10-4-13 
bus request options, 
10-13 
control lines, 10-5-13 
reset of, 8-21 
reset pulse width, 8-21 
response to reset, 8-21 


End-of-file record, 2-3, 2-7 
ESC, 3-107-108, 
4-57 
flags, 3-26 
ESCAPE, 8-6, 10-3 
EU. See Execution 
Unit 
Examine/Alter, 
1-11 
Execution Unit (EU), 3-2, 7-8 
effect of HOLD, 8-53 
operation of, 7-28-33 


Functional subsystems, 
10-4 


HALT, 
10-13 
HLT, 3-109, 4-57 
flags, 3-26 
Hold acknowledge, 
7-3-4 
HOLD/HLDA 
handshake, 
8-53 
bus conditioning, 
8-53 
bus exchange timing, 8-54 
latency, 8-55-56 


IDIV, 3-110-111, 
4-29 
flags, 3-29 
Idle clock, 7-32-33. 
See also T states 


IF. See Interrupt 
flag 
IMUL, 3-112-113,4-26 
flags, 3-27 
IN 
OX, 3-114-115, 
4-58 
flags, 3-26 
port, 3-116-117, 
4-58 
INC 
flags, 3-26 
mem/reg, 
3-118-119, 
4-18 
reg, 3-120-121, 
4-19 
INHI. See Multibus 
INH2. See Multibus 
INIT. See Multibus 
Instruction 
fetch 
logical operation, 
7-29 
Instruction 
queue, 7-6, 7-8 
operation of, 7-29- 33 
tracking operation of, 8-3-6 
INT, 3-122-123, 
4-61 
flags, 3-29 
INT nn, 8-33 
INTO-INT2. See Multibus 
INTA. See Multibus 
Interrupt acknowledge 
bus sequence, 
8-34, 8-36 
effect of loading SS, 8-35 
effect of LOCK prefix, 8-35 - 36 
effect of REP prefix, 8-35 - 36 
effect on BIU, 8-34 
latency, 8-36 
LOCK activity, 8-35 
stack activity, 8-37 
Interrupt enable flag, 7-3-4 
Interrupt flag, 8-33 
Interrupt 
priorities, 8-37 - 38 
Interrupt 
recognition, 
8-33 


Interrupt 
sequence, 
8-34 


Interrupt 
type number, 
8-30 


Interrupt 
vector, 8-30 


Interrupt 
vector table, 8-30, 8-34 


INTO, 3-124-125, 
4-61, 8-33 


flags, 3-29 
I/O channel, 
1-6-7 


I/O instructions, 
4-59 


register indirect addressing, 
4-59 


I/O interface, 
1-2 
10RC. See Multibus 
10WC. See Multibus 
IRET, 3-126, 4-61, 8-33 


flags, 3-29 


JA, 3-127, 4-52 


flags, 3-26 
JAE, 3-128,4-52 
flags, 3-26 
JB, 3-1294-52 
flags, 3-130, 3-26 


JBE,4-52 
flags, 3-26 
JC. See JB 
JCXZ, 3-131, 4-53, 4-55 
flags, 3-26 
JE, 3-132, 4-52 
flags, 3-26 
JG, 3-133, 4-52 
flags, 3-26 
JGE, 3-134, 4-52 
flags, 3-26 
JL, 3-134, 4-52 
flags, 3-26 
JLE, 3-136,4-52 
flags, 3-26 
JMP 
addr, 3-137-138, 
4-51 


conditional, 
4-56 
disp8, 3-139-140,4-51 
displ6, 
3-141-142, 
4-51 


flags, 3-26 
mem, 3-143-144,4-51 
mem/reg, 
3-145-146, 
4-51 


unconditional, 
4-49 


JNA. See JBE 
JNAE. See JB 
JNB. See JAE 
JNBE. See JA 
JNC. See JAE 
JNE, 4-53, 3-147 
flags, 3-26 
JNG. 
See JLE 
JNGE. See JL 
JNL. See JGE 
JNLE. See JG 
JNO, 3-148, 4-53 
flags, 3-26 
JNP, 3-149, 4-53 
flags, 3-26 


JNS, 3-151, 4-53 
flags, 3-26 
JNZ. See JNE 
10, 3-151, 4-53 
flags, 3-26 
JP, 3-152, 4-53 
flags, 3-26 
JPE. See JP 
JPO. See JNP 
JS, 3-153, 4-53 
flags, 3-26 
JZ. See JE 


Key, 2-3 


LAHF, 3-154-155, 
4-5 
flags, 3-26 
LDS, 3-156-157, 
4-4, 4-7 
flags, 3-26 
LEA, 3-158-159, 
4-4, 4-10 


flags, 3-26 
LES, 3-160-161, 
4-4 
flags, 3-26 
Linear change rate reset circuit, 8-22 
Linker, 5-2 
Listing file, 5-1 
Loader, 5-2 
LOCK, 3-162, 4-57 
flags, 3-26 
LOCK,8-8 
relationship to LOCK, 8-9 
usage, 8-10 
LODS, 3-163-164, 
4-45 
flags, 3-26 
LOOP, 3-165, 4-6, 4-17, 4-55 


flags, 3-26 
LOOP, 4-56 
usage, 6-11 - 12 
LOOPE, 3-166, 4-55 
flags, 3-26 
LOOPZ, 3-166, 4-55 
flags, 3-26 
LOOPNE, 3-167, 4-55 
flags, 3-26 
LOOPNZ, 3-167, 4-55 
flags, 3-26 
Loosely coupled, 9-11, 10-4 


Maximum 
mode, 
7-5, 9-1, 
10-1. See 


also MN/MX 
additional functionality, 
8-3 -10 


usage, 8-8 
Memory address space, 7-11-16 


bank selection, 7-11-16 
even addressed 
bytes, 7-13 


even addressed 
words, 7-14 


Odd addressed 
bytes, 7-14 


Odd addressed 
words, 7-15 


selection decoding, 7-4 
Minimum mode, 7-5. See also MN/MX 


bus preemption, 
8-1 


8086 signal description, 
8-1 


MN/MX, 
7-1, 7-4, 8-1 
signals affected by 7-5 
MOY 
flags, 3-26 
mem/reg-data, 
3-180-181, 
4-3 


mem/reg-mem/reg, 
3-168-169, 
4-3 


mem/reg-segreg, 
3-178-179, 
4-3 


reg-data, 3-170-172, 
4-3 


segreg-mem/reg, 
3-176 -177, 
4-3 


MOY 
even byte count, 6-2 - 3 
odd byte count, 6-2 
MOYS, 3-182-184, 
4-45 
flags, 3-26 
MRDC. 
See Multibus 
MUL, 3-185-186, 
4-26 
flags, 3-27 
Multibus 
bus exchange protocol, 9-8-10 
bus relinquish controls, 
9-11 


bus request controls, 
9-11 


8-bit data transfers, 
9-12 
interrupt 
structure, 
9-7 
pin assignments, 
9-1- 2 
priority arbitration, 
9-8 -10 


signal definitions: 
AACK,9-6 
ADRO-ADR13,9-3-4 
BCLK,9-4 
BHEN,9-4 
BPRN,9-5 
BPRO,9-5 
BREQ,9-5 
BUSY, 9-5 
CBRQ,9-5 
CCLK,9-4 
DATO-DATF,9-4 
INHl,9-4 
INH2,9-4 
INIT,9-3 
INTA,9-6 
INTO-INT7, 9-6 
10RC,9-6 
10WC,9-6 
MRDC,9-5 
MWTC,9-6 
XACK,9-6 
signal timing relationships, 
9-7 


16-bit data transfers, 
9-12 


transfer rates, 9-8 
use of BHEN, 9-12 
Multiplexed 
address data bus, 7-8. See also 


Multiplexed 
bus 
Multiplexed 
bus, 7-3. See also Address bus 


Multiprocessor 
systems, 9-1, 10-1, 10-4 


8086 configurations: 
local ROM/EPROM, 
10-8 


local 110, 10-11 
local RAM, 10-11 


multiple system busses, 10-11 
multiple local resources, 
10-5-7 
MWTC. See Multibus 


NEG, 3-187-188, 
4-23 
flags, 3-26 
NMI. See Non-maskable 
interrupt 
Non-maskable 
Interrupt, 
7-4, 8-32 
NOP, 3-189, 4-57 
Normally not ready, 8-24 
Normally ready, 8-24 
NOT, 3-190-191, 
4-36 
flags, 3-26 
Not Ready, 7-8 


Object code, 5-1 
relocatable, 
3-3 
Object code queue. See Instruction 
queue 


Object program, 
1-2-3. 
See also object code 


OE. See Output Enable 
OF. See Overflow flag 
Offset operator, 
6-2, 6-6 
OR 
ac-data, 3-192-193, 
4-37 
flags, 3-28 
mem/reg-mem/reg, 
3-196-197, 
4-36 


mem/reg-data, 
3-194-195, 
4-36 
OUT 
DX, 3-198-199, 
4-58 
flags, 3-26 
port, 3-200-201, 
4-58 
Output Enable, 7-16, 7-18 
timing for, 7-19 
Overflow flag, 8-33 


Parallel priority, 9-5, 9-8 - 9 
Parametet 
passing, 3-5, 4-12, 6-9 
Parameter 
tracking, 8-63 
POP 
flags, 3-26 
mem/reg, 
3-202 - 203, 4-5 
reg, 3-204- 
205, 4-5 
segreg, 3-206-207, 
4-5 
POPF, 3-208 - 209, 4-5 
flags, 3-29 
Ports 
data, 1-6-7 
status, 
1-6-7 


control, 
1-6-7 
Predefined interrupts, 
8-30 
Interrupt 
0, 8-32 
Interrupt 
1, 8-32 
Interrupt 
2, 8-32 
Interrupt 
3, 8-33 
Interrupt 
4, 8-33 
Prefix instruction, 
3-6 
Private resources, 
10-4 
Program 
general debugging techniques, 
1-11- 12 


general design techniques, 
1-9-10 


maintenance 
of, 1-14 


Program 
(Continued) 


module, 
1-8- 9 
specification of, 1-8- 9 


Program memory, 
1-2 


Program status word, 8-32 
PUSH 
flags, 3-26 
mem/reg, 
3-210-211, 
4-4 


reg, 3-212-213, 
4-4 


segreg, 3-214-215, 
4-4 


PUSHF, 3-216-217, 
4-4 


flags, 3-26 


Queue status 
QSO,7-6 
QSl, 7-6 
QSO, QS1 encoding, 
7-6, 8-3 


use with ICE 86, 8-3 


Queue tracker, 
8-3, 8-5 - 6 


RC reset circuit, 8-21-22 
RCL, 3-218-219, 
4-64 


flags, 3-28 
RCR, 3-220-221, 
4-65 


flags, 3-28 
Reentrant 
program, 
3-3-4 


Relocatable object code, 3-3 
Record number, 
2-3 


Register set, 3-20 
base register, 3-21 
count register, 3-22 
data segment register, 3-21 
8-bit, 3-21 
general purpose, 3-21 
I/O address register, 3-22 
I/O data register, 3-21 
16-bit, 3-21 
Release pulse. See RQ/GT protocol 
REP,3-222-223 
flags, 3-26 
Repeat prefix 
REP/REPE/REPZ, 
4-46 


RET, 3-224- 226, 4-50 


displ6, 
3-228-231, 
4-50 


flags, 3-26 
RETURN, 
4-49 
ROL, 3-231-232, 
4-66 


flags, 3-28 
ROR, 3-233 - 234, 4-67 


flags, 3-28 
Rotate instructions, 
4-62 


Routine call techniques, 
6-9 


RQ/GT protocol, 8-65 
RQ/GT timing, 8-65 
RQ/GT 
to 
HOLD/HLDA 
conversion, 
8-61-62 


SAHF, 3-235-236, 
4-5 


flags, 3-28 
SAR, 3-237 - 238, 4-69 


flags, 3-28 


SBB 
ac-data, 3-239- 
240, 4-22 


flags, 3-26 
mem/reg-data, 
3-241-242, 
4-22 


mem/reg-mem/reg, 
3-243 - 244, 4-22 


SCAS, 3-245-247, 
4-45 
flags, 3-26 
Segment override 
prefix, 3-247 - 248, 4-44 


Segment size, 3-22 
Serial priority, 9-8-9. 
See also Daisy chain 


Shared memory, 
10-4 
Shared resources, 
10-4 
Shell sort, 6-3 
Shift instructions, 
4-62 
SHL 
as used for multiply, 6-5 


SHLISAL, 
3-249- 
252, 4-68 


flags 3-28 
SHR, 3-252 - 253, 4-70 
flags, 3-28 
Signetics object code format, 4-39 
Single step, 1-11,3-24, 
8-32 


Software interrupts, 
4-60, 8-33 


Source code, 5-1 
Source file, 5-1 
Source 
program, 
1-3. See 
also 
Assembly 


language 
Source code 
Special function processor. See Co-proccesor 
Stack parameter 
passing, 4-9-10 


STC, 3-254, 4-57 
flags, 3-28 
STD, 3-255, 4-57 
flags, 3-28 
ST!, 3-256- 257, 4-57 
flags, 3-28 
STOS, 3-258 - 259 
4-45 
flags, 3-26 
String primitives, 
4-44, 4-46 


CMPS,4-31 
CMPSB,4-48 
CMPSW,4-48 
SCAS, 4-31, 4-33 
SUB 


ac-data, 3-260-261, 
4-22 


flags, 3-26 
mem/reg-data, 
3-262-263, 
4-22 


mem/reg-mem/reg, 
3-264- 265, 4-22 


Symbol file, 5-1 
System 
configurations, 
3-3 
general definition of, 1-1- 2 
maintenance, 
1-14 
specification of, 1-5- 8 
System guide. See Documentation 
System reset, 8-21. See also 8284 


TEST, 6-8 
ac-data, 3-266-267, 
4-37 


flags, 3-28 
mem/reg-data, 
3-268-269, 
4-37 


mem/reg-mem/reg, 
3-270-271,4-37 


TF. See Trap flag 
T states, 7-8-10 
Tl,7-8 
T2, 7-8 
T3, 7-8 
T4, 7-8 
TI, 7-10 
TW,7-8 


Tightly coupled, 9-11, 10-4 
Timing relationships, 
classes of, 8-63 


Trap flag, 8-32 


WAIT, 3-272, 4-57, 10-3 


flags, 3-26 
Wait state, 7-4, 7-8, 8-7, 8-24- 25 
Wait state generator, 
8-28 


XACK, 10-7-8. 
See also Multibus 


XCHG 


flags, 3-26 
reg-mem/reg, 
3-275 - 276, 4-3 
reg, 3-273 - 274, 4-4 
XLAT, 3-277 - 278, 4-4 
flags, 3-26 
XOR 


ac-data, 3-279-280, 
4-37 
flags, 3-28 
mem/reg-data, 
3-283 - 284, 4-37 


mem/reg-mem/reg, 
3-281-282, 
4-37 
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